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EXPERIMENTS IN LEADERSHIP TRAINING FOR 


DECISION-MAKING 


DEAN C. 


I I has been estimated that the number 
of formal organizations in the United 


States is greater than the number of 


individuals in our total population. 


These organizations, diverse as they may 
be in purpose and membership, employ 
remarkably similar methods for conduct- 
ing their affairs and adjusting to extern- 
al situations. Probably no single method 
of arriving at group judgments is more 
widely that of the decision- 


used than 


making discussion group, or the “com- 


mittee.” To many participants in or- 


ganizational activities, the recognition 


ofa problem situation and the appoint- 


ing of appropriate committees must 
seem to occur as almost simultaneous 
events. 


A large body of experimental evidence 
justifies this widespread confidence in 
group discussion. It has been established 
that 
group judgments are olten superior to 


on fairly substantial grounds 
individual judgments, that groups can 
explore a greater range of suggestions 
than can individuals, and that partici- 
pation in organizational decisions can 
be psychologically satisfying to group 
members. 


In spite of these theoretical advan- 
*Based upon Ph.D. dissertation, Northwestern 


University, 1951, directed by Kenneth G. Hance 
and the late Robert E. Seashore. 
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tages, there is considerable empirical 
evidence to the contrary. Both partici- 
pants and observers have repeatedly em- 
phasized that in practice discussion is 
often time-consuming and energy-con- 
suming, that committees are often super- 
ficial in their investigation, and that 
group deliberations are often distorted 


by emotional crises of 


one kind or an- 
other. All of this would suggest that 
some bridge must be found between 


the theoretical possibilities of discussion 
and the abuses accompanying the every- 
day application of the method. If an 
effective catalyst can be discovered, the 
joining of potential and practice may be 
brought about. 

The fecal position of the leader of a 
discussion group suggests that better per- 
formance of committee assignments may 
result from improved leadership. While 
the chairman is not completely or solely 
the 


group, it is apparent that he occupies 


responsible for behavior olf 


any 
a position of considerable influence, and 
that the manner in which he exercises 
his leadership probably has a great deal 
to do with the group’s effectiveness in 
fulfilling its. responsibilities. It would 
seem desirable, therefore, to apply the 
techniques of scientific experimentation 
to the training of discussion leaders as 
the most direct 


way of securing im- 
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provement in discussion practice. In- 
creased understanding of the problems 
of group leadership should also be of 
practical value for those who are re- 
sponsible for teaching discussion meth- 
ods. Accordingly, the following objec- 
tives were adopted for this investigation: 
theoretical rationale for 
the experimental study of group leadership. 


1. To formulate a 


2. To discover some of the essential functions 
of leadership in decision-making discussion 
groups. To accomplish this objective it was 
considered desirable: 

a. To determine the types of behavior 
which characterize the successful leader 
of decision-making groups. 

b. To determine the types of leadership 
interactions required by decision-making 
groups in the actual 
their duties. 


performance of 


3. To determine the effects of training on the 
performance of leadership functions in de- 
cision-making discussion groups. To ac- 
complish this objective it was necessary: 

a. To select a method by which persons 
of different levels of leadership ability 
could be chosen. 

b. To construct and evaluate a rating scale 
for measuring ability to perform leader- 
ship functions. 

c. To prepare an_ instructional manual 
which could be used for training leaders. 

d. To develop training procedures for im- 
proving skill in group leadership. 


I. EXPERIMENTAL RATIONALE 


The initial task facing the student of 
leadership is to construct a conceptual 
framework for his study. In most scien- 
tific areas some clues will be found in 
the related literature. A review of the 
literature on leadership, however, re- 
vealed that leadership has been studied 
extensively, but that most of the experi- 
mental work prior to the last decade has 
been devoted solely to the identification 
of the “personal traits” of established 
leaders. In general, long lists of ambigu- 
ous and sometimes contradictory traits 
have resulted, without creating any 
satisfactory basis for evaluating, select- 


ing, or training group leaders.1 These 
unsatisfactory results suggested that no 
more reliable or useful scientific data 
for training group leaders were likely 
to be obtained until a more valid theo- 
retical hypothesis could be formulated. 


Under the impetus of the theoretical 
concepts of Kurt Lewin and J. L. Mor- 
eno, there has been, during the past 
decade, a renewed effort to study leader- 
ship in face-to-face groups. Some of these 
studies have investigated the effects of 
different styles of leadership on group 
behavior,? some have related leadership 
techniques to situational factors, and 
still others have evaluated methods of 
selecting leaders. The possibilities of 
training and re-education have been ex- 
plored with highly suggestive results by 
several investigators. Jack and Page, 
working with preschool children, dis- 
covered that ascendant behavior is large- 
ly dependent upon situational skill and 


confidence. Alex Bavelas, in training 


adult recreational leaders, found that it 
was possible to modify leadership meth- 
ods so as to improve group participa- 


1See H. L. Smith and L. M. Krueger, “A 
Brief Summary of the Literature on Leader- 
ship,” Bulletin of the School of Education, 
Indiana University, IX (1933); W. O. Jenkins, 
“A Review of Leadership Studies with Particu- 
lar Reference to Military Problems,” Psycho- 
logical Bulletin, XLIV_ (1947), 54-79; R. M. 
Stogdill, “Personal Factors Associated with 
Leadership: A Survey of the Literature,” Jour. 
of Psychology, XXV_ (1948), 35-71. 

2R. Lippitt, “An Experimental Study of the 
Effect of Democratic and Authoritarian Group 
Atmospheres,” University of Iowa Studies in 
Child Welfare, XVI, No. 3 (1940). 

3J. K. Hemphill, “Situational Factors in 
Leadership,” Ohio State University Bureau of 
Educational Research Bulletin, No. 32 (1949). 


4L. D. Zeleny, “Objective Selection of Group 
Leaders,” Sociology and Social Research, XXIV 
(1940), 326-36. 

5L. M. Jack, “An Experimental Study of 
Ascendant Behavior in Pre-School Children,” 
University of Iowa Studies in Child Welfare, 
IX, No. 3 (1934); D. P. Page, “The Modifica- 
tion of Ascendant Behavior in Pre-School Chil- 
dren,” University of Iowa Studies in Child Wel- 
fare, XII, No. 3 (1936). 





tr: 


hese 
no 
lata 
kely 
heo- 
ited. 


tical 
Ml or- 
past 
der- 
hese 
s of 
‘oup 
ship 
and 
s of 
s of 
1 ex- 
s by 
age, 
dis- 
irge- 
and 
ning 
at it 
1eth- 
cipa- 


‘ader- 
ation, 
nkins, 
rticu- 
ycho- 
M. 
with 
Jour. 


f the 
sroup 
es in 


rs in 
au of 
1949)- 
sroup 
XXIV 


ly of 
iren,” 
elfare, 
lifica- 
Chil- 
| Wel- 





EXPERIMENTS IN LEADERSHIP TRAINING 3 


tion and morale.® Discussion and role- 
playing procedures have been used suc- 
cessfully by French to retrain an auto- 
cratic scout leader,’ and by Zander to 
train necessarily dominating leaders of 
naval personnel.® 


It should be emphasized, however, 
that none of these studies predicated 
any hypothesis or explanation of group 
leadership which could be used to evalu- 
ate or prepare leaders for a variety of 
social situations. In view of this, an 
analysis of the conditions of group mem- 
bership and group leadership was at- 
tempted so that an explanation could be 
formulated which would give direction 
and organization to the present investi- 
gation. Stated in its simplest form, this 
hypothesis is that leadership in a given 
setting depends upon the maintenance 
of a unique functional relationship with 
the other members of a group. Unless 
a group can make satisfactory adjust- 
ments to its problems of coordination 
and control, social activity appears to be 
impossible. The specific adjustments re- 
quired by a particular type of social 
situation will depend, of course, upon 
the nature of the problem and the tech- 
niques of adjustment common to the 
group. In developing this hypothesis, 
a number of assumptions about group 
behavior were adopted which constitut- 
ed the experimental rationale of this 
study. 

1. Individual activity is directed at the satis- 
faction of personal needs and wants, most 
of which can be fulfilled only through co- 
operative effort with others. 

2. Association with maintained as 
long as adequate progress toward goals is 
experienced. 


others is 


6A. Bavelas, “Morale and the Training of 
Leaders,” in G. Watson, Civilian Morale (Bos- 
ton, 1942). 

7J. R. P. French, Jr., “Retraining an Auto- 
cratic Leader,” Jour. of Abnormal and Social 
Psychology, XXXIX (1944), 224-37. 

8A. Zander, “Role Playing: A Technique for 
Training the Necessarily Dominating Leader,” 
Sociatry, I (1947), 225-35. 


3. Cooperative activity requires the perform- 
ance of certain general coordinating func- 
tions inherent in all social activity, and 
of specific functions arising out of the 
uniqueness of the situation. 

4. Leadership consists of the performance of 

those coordinating functions which are in- 

dispensable to continuation of group ac- 
tivity. 

Group members adopt techniques of ad- 

justment they perceive as most effective in 

serving their needs. 

6. Repeated success in coping with coordinat- 
ing responsibilities leads to the identifica- 
tion of certain group as 
“leaders.” 


members of a 


The theory that leadership represents 
a unique behavioral relationship with 
the members of a group is a provocative 
experimental hypothesis. If correct, a 
“job analysis” should reveal the specific 
leadership needs for any type of social 
activity. Discovery of these functions 
should make it possible to evaluate 
various techniques of leadership and to 
provide training designed to raise the 
proficiency with which mediocre leaders 
perform these functions. 


II. ISOLATION OF LEADERSHIP FUNCTIONS 


The first experimental phase of the 
investigaton was concerned with identi- 
fying the specific functional require- 
ments of leadership in decision-making 


discussion groups.° A questionnaire 


study and an observational study were 


conducted to obtain data relevant to 


this purpose. 


A. Procedure. 


In the questionnaire study, twenty- 
four prominent writers, teachers, and re- 


9It was necessary to confine the experi- 
mental phases of this investigation to the de- 
cision-making discussion group. It should be 
remembered that discussion groups of this 
kind operate under external pressures which 
seriously affect the character of leadership and 
participation. The necessity for producing a 
negotiable decision within prescribed time 
limits makes some departures from accepted 
discussion procedures necessary. 
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search workers in the field of discussion 


were asked to recall an outstanding 
leader of a decision-making discussion 
group whom they had observed, and to 
list the specific things which he or she 
did which they believed contributed to 
the successful operation of the group. 
The observational study was designed 


to determine the functions of leader- 
ship as they were observed in the actual 
operations of face-to-face decision-mak- 
ing groups. The subjects used in this 
study were Freshmen students enrolled 
in classes in Group Discussion during 
the fall quarter of 1950 at the School of 
Speech, Northwestern University. “The 
forty-one students in these classes were 
different decision- 


assigned to eight 


making groups. No leaders were ap- 


The 


“leaderless” or “stress” group technique 


pointed for these committees. 
was used because of the intrinsic experi- 
mental advantages of this method: (1) 


Lack ol 
group 


leader forces 
make all of 


own procedural decisions. (2) 


an appointed 
members to their 
Selection 
of a problem within the range of experi- 
ence of all group members, in the ab- 
results in 
(3) Obser- 
vational results will not be biased by 


sence of an official leader, 


competition for leadership. 


the presence of an appointed leader ff A 
problem was then chosen (“What should 
be the regulations governing attendance 
at Speech Assemblies?”) which would 
insure considerable ego-involvement and 
on which the degree of information 
would be approximately equal for all 
Each 


structed to submit a set of written rec- 


participants. committee was in- 
ommendations on this problem for pres- 
entation to the Faculty of the School 
of Speech. ‘The recommendations of 
each group were collected at the end of 
the fifty-minute period. 


Students enrolled in courses in Group 
Leadership and Group Dynamics at the 


School of Speech, Northwestern Univer- 
sity, were used as observers in this ex- 
periment. It was felt that students in 
these courses were especially qualified 
for observational they are sensi- 
tive to interpersonal attitudes in groups, 
they are 


work: 


familiar with the 


process, and they are alert to tec hniques 


discussion 


of leadership. In addition to their gener- 
al background, the observers were given 
three days of intensive preparation for 
their Dur- 
ing this time the experimenter discussed 
with them the nature of the experiment 
and the necessity for 


observational assignments. 


careful observa- 


tions. The observers were instructed to 


record verbatim all 


interactions which 


constituted acts olf leadership. Leader- 
ship interactions were defined for them 


as “those which 


interactions were at- 
tempts to influence the group in ways 
that were necessary to achieving the 
group goals, and which were acted upon 
in a positive manner by the members of 
the group.” Identification of leadership 
contributions was to be based solely on 
the criteria contained in this definition. 
A method of recording was then de- 
scribed, and several practice sessions 
were held to develop skill in recording 
leadership interactions: numbers were 
used to identify group members, the 


direction of interaction indicated 


by a dash separating initiator and re- 


was 


spondent, and the content of the inter- 
action was to be recorded verbatim (2-5: 
“Does everyone agree on that point?’). 
Practice 


sessions furnished an 


Oppor- 
tunity to work out interpretive or re- 
cording difficulties. ‘Two observers, one 
from the class in Group Leadership and 
one from Group Dynamics, were then 
assigned to each of the eight commit- 
tees. After the discussions, each observer 
prepared a written description of the 


meeting which could be used later in 
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resolving differences in interpretation 
which might arise. 


B. Results. 


Replies to the questionnaire on lead- 


ership behavior were received from 


eighteen of the twenty-four experts who 
Items of behavior re- 


were contacted. 


ported by the experts were recorded 
on a master sheet as they were received. 
The master list was then analyzed to 
discover the most 


practical way of 


grouping these items. A classification 
system was finally adopted which did 
not seem to violate the intent of the 
items reported, and specific units of be- 
havior were listed under the most ap- 
propriate class of leadership behavior. 
been summarized in 


The results have 


lable I. 


TABLE I 


RESPONSES OF EXPERTS TO QUESTIONNAIRE 
ON LEADER- BEHAVIOR. 








Classification of Frequency of 


Leader-Behavior Identification 
Initiating Group Action 13 
\rranging for Mechanics of 

Operation 12 
Climate- Making 20 
Regulating Participation 13 
Stimulating Group Thinking 8 
Guiding Group Thinking 24 
Information-Seeking and 

Evaluating 7 
Clarifying and Resolving 

Group Conflicts 14 
Summarizing Group Thinking 16 
Negative Behavior* 8 
Total 135 


*Several respondents mentioned behavior in 
which the leader did not engage which they 
felt contributed to the successful operation of 
the group. 


Ten different types of behavior were 
derived which appeared to be character- 
istic of the successful leadership of dis- 
cussion groups. Since the groups which 
had been observed probably varied con- 
siderably in size, membership, and other 
situational factors, too much importance 


should not be attached to the frequency 
with which any particular type of activi- 
ty was reported. All of the functions 
were noted with sufficient frequency for 
them to be considered characteristic of 
successful leadership under these gen- 
eral conditions. 

Each of the two observers in the eight 


student committees recorded all inter- 


actions which appeared to them to con- 
stitute acts of leadership. A combined 


account for each committee was then 
drawn up by combining the entries of 
both observers. An analysis was made 
of this report, and each item of behavior 
was Classified according to the function 
it served at that point in the discussion. 
This classification was reviewed with the 
errors of inter- 


observers to avoid any 


pretation. The complete record for 


the eight summarized in 
Table II. 
The 


tions for the eight committees ranged 


groups is 


number of leadership interac- 
from forty-five to sixty-four for the fifty- 
minute sessions. Nearly all of the types 
of behavior which were identified by the 
observers were common to most commit- 
tees with the exception of “Regulating 
and 
“Introducing of Group Members.” The 


Participation,” ‘“‘Climate-Making,” 


absence of these activities in several 


groups may be traceable to the size or 
membership of the groups, or the train- 
ing these students had already been 
given in discussion, or it may indicate 
the optional nature of these functions 
in some social settings. 

Leadership patterns for these groups 
ranged from those in which a single in- 
dividual was responsible for nearly all 
leadership to those in which there was 
a complete diffusion of leadership re- 
sponsibilities. Twenty-one different 
types of activity were reported for the 
decision-making committees. No judg- 
ment of the relative importance of these 
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TABLE II 








Type of Interaction 


Taking Initiative in 
Discovering Problem 

Providing Clarification 
of Problem 


Initiating Group 
Thinking 


Suggesting Agenda or 
Method of Proceeding 


Securing Selection of 
Leader or Secretary 


Introducing Group 
Members 


Insuring Recording 
of Decisions 


Checking Time 
Limits 


Climate- 
Making 


Regulating 
Participation 


Stimulating Further 
Exploration of Ideas 


Directing Group to 
New or Former Issue 


Delaying Consideration 
* to Later Time 


Noting Digressions 
from Agenda 


Restating Problem 
or Re-Orienting 


Noting Need for 
Further Information 


Clarifying Agreements 
or Disagreements 


Resolving 
Differences 


Summarizing or Asking 
for Summary 


Ascertaining Group 
Opinion 


Terminating Group 
Activity 


Totals 


ho 


bo 


no 


wr 


S$ 


bo 





Committee 
a 5 6 
1 1 1 
2 1 1 
1 1 1 
2 o 2 
1 1 1 
oOo Oo 1 
2 2 9 
8 1 5 
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5 8 4 
A 8 m 
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2 1 A 
1 oO { 
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3 1 2 
7 : 5 
5 6 8 
2 6 1 
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functions is warranted on the basis of 
this experiment, but it is interesting to 
note the four behaviors which were re- 
ported most frequently: (1) “Summar- 
izing or Asking for Summary,” (2) “Di- 
recting Group to New or Former Is- 
sue,” (3) “Ascertaining Group Opinion,” 
(4) “Resolving Differences.” 

When the results obtained from the 
questionnaire and observational study 
are combined it will be apparent that 
one classification system is sufficient for 
summarizing both sets of data. When 
this is done, the following can be de- 
rived as some of the essential functions 
of leadership in decision-making discus- 
sion groups: (1) Initiating Group Ac- 
tion, (2) Arranging for Mechanics of 
(3) Climate-Making, (4) 
Regulating Participation, (5) Stimulat- 
ing Group Thinking, (6) Guiding 
Group Thinking, (7) Information-Seek- 
ing and Evaluating, (8) Clarifying and 
Resolving Group Conflicts, (g) Summar- 
izing Group Thinking. 

Despite the limitations of method in- 
herent in the interpretation of a ques- 
tionnaire, and the difficulties to be over- 
come in identifying and recording group 
interactions, it was felt that human and 
methodological sources of error’ were 
controlled within the limits demanded 
by the experimental purpose. No claim 
is made that all the functions of leader- 
ship have been isolated or even that 
the relative importance of these func- 
tions is known; only that some of the 
essential leadership requirements of de- 


Operation, 


cision-making groups have been de- 
termined. 
III. ‘TRAINING IN LEADERSHIP FUNCTIONS 


If the leadership requirements of de- 


cision-making discussion groups are 
known then it should be possible to 
evaluate techniques of leadership and 


to improve the skill with which average 


or inferior leaders fulfill these require- 
ments. The object of the final phase of 
this investigation, therefore, was to de- 
termine the effects of training on the 
performance of leadership functions as 
shown by the ratings of trained observ- 
ers. To accomplish this purpose it was 
necessary to select suitable subjects, to 
construct and evaluate a rating device 
for measuring leadership ability, and to 
develop training materials and 
cedures. 


pro- 


A. Procedure. 


Three experiments were conducted so 
that sufficient data could be collected to 
evaluate the effects of training. 


1. Selection of Subjects. The sub- 
jects used in this study were Freshman 
and Sophomore students enrolled in 
classes in Group Discussion during 1950- 
1951 at the School of Speech, Northwes- 
tern University. To secure an estimate 
of the relative leadership standing of 
the members of these classes, the group 
nominating technique was used. After 
four or five weeks had passed and each 
student had observed his classmates as 
leaders, “Nominating Ballots” were dis- 
tributed. Classes were informed that in 
a short time they would be divided up 
into small committees to discuss and 
reach a decision upon a difficult problem 
affecting their future activity in the 
course. Each student was instructed to 
nominate two members of the class who 
would, in his judgment, make the 
“most effective’ and the two who would 
make the “least effective’ leaders his 
group could have for working on this 
problem. Nominations for the “most 
effective” leader were given a value of 
+1, and nominations for “least effective” 
leader were scored a —1. The sum of 
these two scores was used to determine 
the leadership rank of each member of 
the class. 
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Selection of the experimental and con- 
trol subjects was made on the basis of 
the results obtained from the ‘“Nominat- 
ing Ballots.” Since the experimental 
design required that no more than six 
persons be trained in any one experi- 
ment, the following criteria were used 
to choose the specific subjects for these 
experiments: (1) Students were chosen 
who ranked as the “least effective” lead- 
ers of their classes. This was done so 
that the effects of training on “crucial” 
cases could be measured. (2) Experi- 
mental and control subjects were equat- 
ed on the basis of their leadership rank 
in their respective sections. By equating 
students, a comparison could be made 
between the effects of the training pro- 
traditional class methods. 


cedures. and 


(3) ‘The recommendations of the in- 
structors were used to insure the selec- 
tion of conscientious students who could 
be depended upon to attend the train- 
ing class meetings and the regular class 
This avoid 

the the 
prolonged absence of any subject. (4) 


sessions. was necessary to 


distorting results because of 
An approximately equal number of male 
and female subjects were chosen to take 
into account possible sex differences. In 
all, sixteen experimental and ten control 
subjects were selected for the three ex- 
periments. 


2. Setting of the Problem. The re- 
maining members of the classes in Group 
Discussion were organized into decision- 
making discussion groups on the basis 
of the leadership ratings of the members 
of the sections. Each committee was as- 
signed students from the upper, middle, 
and lower third of the class according 
to the returns from the “Nominating 
Ballots.” 


ized so that the leadership ability of the 


‘These committees were organ- 


experimental and control subjects could 
be demonstrated before and after train- 


ing. Students were assigned to the same 


committees for both the Pre-Tests (com- 
mittee meetings prior to the training 
period) and Post-Tests (committee meet- 
ings following the training period) so 
that interpersonal relationships and the 
general level of ability would be kept 
as constant as possible. An experimental 
or control subject was then appointed 


to act as chairman for each of these 
committees. 
Problems were formulated for these 


groups which were felt to be of sufficient 
difficulty and importance to create genu- 
ine leadership problems, and each com- 
mittee was required to submit detailed 
recommendations on these problems 
within the fifty-minute period. (Pre-Test: 
“How should grades be determined in 
Post-Test: “How 


the final examination be handled in this 


this course?” should 
course?’’) 

Observers were assigned to each of 
the committee meetings for all three 


experiments. ‘hey were instructed to 
evaluate the leadership provided by the 
chairman, using the Barnlund-Haiman 
Leadet Rating Scale. (See pages 10-11.) 

After the leadership ability of the sub- 
jects had been appraised by the observ- 
ers, the experimental subjects were given 
a two-week Leadership Training Course 
while the control subjects were returned 
to their regular classes. At the end of the 
two-week period Post-Test meetings 
were conducted to re-evaluate the leader- 
ship ability of both experimental and 
control subjects. Chairmen were re-as- 
signed to committees composed of the 
same student members and were rated 
again by the same observers as in the 
Pre-Tests. Ratings given by the observ- 
ers were then treated statistically to de- 
termine the comparative effects of the 
training and control procedures on 
leadership ability. During the following 
school quarter Evaluation Sheets were 
mailed to all experimental subjects re- 
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questing them to appraise the value of 
the Leadership Training Course. 

9. The Leader Rating Scale. In many 
areas of psychological research sufhcient- 
ly sensitive and dependable scales have 
been developed so that the experimenter 
need only select the proper instrument 
the 
for students of leadership. No satisfactory 


for his purpose. ‘This is not case 
scales have been perfected for appraising 
proficiency in leadership. J his may be 
due to lack of appropriate theoretical 
assumptions or because of the inherent 
difficulty in measuring anything as com- 
plex as leadership: in either case these 
obstacles had to be overcome to accom- 
plish the central purpose of this study. 

In the preparation of the Leader Rat- 
ing Scale, the following assumptions 
were made: (1) Leadership is the behav- 
ioral response of an individual to cer- 
tain function needs of groups. ‘Trained 
to 
(2) 
cratic” or “group-centered” leadership is 
the 


observers should be able indentify 


these acts of leadership. “Demo- 


most desirable form of direction. 


Deviations from democratic leadership 
are justified only when situational vari- 
ables make them necessary. (3) Evalua- 
tion of leadership must always be made 
in terms of a specific setting. A scale 
which is not situationally adaptive is 
not realistic. (4) Skill in leadership re- 
quires both sensitivity to group needs 
and skill in making necessary adjust- 
ments. An adequate scale should dis- 


tinguisi the former from the latter. 
ihe Leader Rating Scale which was 
finally developed took situational vari- 
ables into account, provided dual scales 
lor appraising the sensitivity and adjus- 
tive ability of the leader, and contained 
sections corresponding to the functions 
of leadership discovered in Experiment 
I. I'wo scales, one for rating sensitivity 
to group needs and the other for rating 
idaptive ability, were prepared on the 


following functions: (1) Initiating Dis- 
(2) Climate-Making, (3) Regu- 
(4) 
(5) Di- 


recting Group Thinking, (6) Summar- 


cussion, 


lating Amount of Participation, 


Stimulating Group Thinking, 


izing, (7) Resolving Conflict. An eighth 
“Overall 
was added to provide a balanced judg- 


section, Leadership Ability,” 
ment of the total effect of the leader’s 
behavior. (See Leader Rating Scale.)'® 

Since the validity of a measuring in- 
strument is largely dependent upon the 
source of its items, considerable confi- 
dence seems warranted in a scale based 
on experimental observation. That the 
scale actually measures leadership abili- 
ty is borne out by other external checks: 
observer reports describing the leader's 
activity showed consistent agreement 
with the ratings assigned, and the low 
scores received by “ineffective” leaders 
in the Pre-Tests are another indication 
that the scale accurately reflects leader- 
ship ability. No validity check could be 
made against other measuring instru- 
ments since none exists. 

Reliability of the Leader Rating Scale 
depends, of course, upon the consistency 
with which observers rated leaders 
when using this scale. Differences in rat- 
ings must be due to variability in sub- 
jects, not to variability in observations. 
‘The observers used in the training exper- 
iments were again drawn from classes in 
Group Leadership and Group Dynamics 
at Northwestern University. Several davs 
of training were spent clarifying the as- 
sumptions of the scale, defining the lead- 
ership functions specifically, and using 
the scale in concrete cases. ‘Two observ- 
ers were then assigned to each student 
both the Pre-Test and 


committee for 


Post-Test. (Observers had no way of 


10Instructions for administering the scale 
will be found in “Appendix A: The Barnlund- 
Haiman Leader Rating Scale” in F. S. Haiman, 
Group Leadership and Democratic Action (New 


York, 1951) 
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Group needed more 
help from leader in 


getting started. 
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BARNLUND-HAIMAN LEADER RATING SCALE 


Copyright 1951 


INITIATING DISCUSSION 
1 Oo ! 2 
- Group got right 
amount of help in 
getting started. 


The quality of the leader's introductory remarks was: 


| 


Exceflent 


ss 9 


Group needed more 
help in securing a frank, 
permissive atmosphere. 


= | cee teal 


Good — Adequate ~~ Fair 


CLIMATE-MAKING 
1 oO 1 2 
Group got right 
amount of help in 
securing a frank, 
permissive atmosphere. 


frank, permissive 
atmosphere. 


Group needed less 
help in securing a 


3 


- Group needed less 
help in getting started. 


~ Poor 


With regard to the establishment and maintenance of a frank and permissive atmosphere, 
the leader’s behavior was: 


| 


Excellent 


3 2 


Group needed more 
regulating of par- 


ticipation 
If and When the 
| 


Excellent 


2 — 
Group needed 
more stimulation 
If and When the 
| 


Excellent 


3 2 
Group needed 
more direction 
in thinking 

If and When the 
| 


' - 
Excellent 


3 2 
Group needed 

more summaries 
If and When the 


| 


Excellent 


Good Adequate Fair 
REGULATING AMOUNT OF PARTICIPATION 
1 oO 1 2 


Group got right 
amount of regulation 
of participation ticipation 


leader attempted to regulate participation, his method was: 
| | | 


Good Adequate Fair 


STIMULATING GROUP THINKING 
1 Oo ! 2 
Group got right 
amount of stimulation stimulation 
leader attempted to stimulate group thinking his method was: 


! 


Good Fair 


Adequate 
DIRECTING GROUP THINKING 


1 Oo 1 2 
Group got right 
amount of direction 


in thinking 


direction in 

thinking 

leader attempted to direct group thinking, his method was: 

| | | 

Good Adequate Fair 
SUMMARIZING 

i o 1 2 


Group got right 
amount of summaries 


Group needed 
fewer summaries 


leader provided summaries, his method was: 
' 1 | 
Fair 


Good Adequate 


Group needed less 
regulation of par- 


Group needed less 


- Poor 


Poor 


Poor 


3 


Group needed less 


Poor 


Le 


- Peor 








ere, 


Poor 


Poor 


Poor 


Poor 


ww 


Peor 


A 3 a 2 1 
Group needed more 
help in resolving 
its conflicts 
conflicts 


Excellent 


Group needed 
more control 


was: 


-ncellen Good 
identifying the experimental or control 
leaders.) 

The reliability of the scale was de- 
termined from the scores of the pairs of 
observers who rated the experimental 
and control leaders. Scatter diagrams 
were prepared, and Pearson product- 
moment reliability coefficients were ob- 
tained for Parts A and B of the Leader 
Rating Scale on Pre-Tests and Post- 
Tests as well as for the scale as a whole. 
The overall reliability of Part A was 
found to be .63 (P.E. = .037) and of Part 
B was .58 (P.E. = .o40). In view of the 
comple < nature of leadership and the 
homogeneity of the sampled population, 
these coefficients justify 
confidence in the scale. 

4. Training Materials and Proced- 
ures. Performance of a particular skill 
can be improved only if proper objec- 
tives and sound methods of training are 
adopted. Improvement in the leader- 
ship ability of the experimental subjects 


considerable 











seemed to require an increased under- 
| standing of the nature of group leader- 
| ship, the adoption of more constructive 
| attitudes toward leadership responsibili- 
| ties, and increased sensitivity to, and 
competence in performing, the functions 
of leadership in decision-making groups. 


It was decided at the outset that to 





- Group got right 
amount of help in 
resolving its 


Group got right 
amount of control 


Adequate 
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RESOLVING CONFLICT 


o 1 2 3 
Group needed less 
help in resolving 

its conflicts 


B. If and When the leader attempted to resolve conflict, his method was: 


Fair 


Good \dequate ~ Poor 
OVERALL LEADERSHIP ABILITY 
A. 3 2 1 o 1 2 8 


Group needed less 
control 


B. If and When the leader attempted to control the group, the general quality of his leadership 


Fair 


study all of the functions within a two- 
week training period was impossible. 
Instead a training manual was prepared 
on the following topics only: (1) “Lead- 
ership and Discussion,” (2) “Initiating 
Group Action,” (3) “Directing Group 
Thinking,” (4) “Summarizing Group 
Thinking,” (5) “Resolving Group Con- 
flict.” Other problems of leadership 
were studied as they occurred during 
the course of training. The purpose of 
the manual was to set forth, as briefly as 
possible, some of the leadership needs 
of decision-making groups, to clarify 
the nature of discussion leadership, and 
to describe principles and methods for 
coping with leadership problems. 

A variety of methods was used to 
accomplish the objectives of the course: 
the training manual was read and re- 
viewed to provide a theoretical frame- 
work for training; diagnostic discussions 
were conducted to explore the operation 
of these principles in concrete cases; and 
a high degree of group self-direction 
was cultivated to encourage the adop- 
tion of optimum attitudes toward group 
participation. The most important 
training innovation, however, was the 
use of role-playing or reality-practice as 
a training method. The role-playing 
technique involves the setting-up of an 





artificial situation in which students 
act out a behavior problem which they 
wish to explore. Each person adopts a 


particular role and reacts spontaneously 


to the problem. (In most cases ‘“‘con- 
crete” roles are used in which each per- 
son plays a specific individual. In the 


Leadership Training Course, “abstract” 
roles were used which involved unique 
types of After 
dramatizations, participants and observ- 


behavior instead.) the 
ers analyze the action. ‘The purpose ol 
this diagnosis is to evaluate the adap- 
Fol- 


lowing the appraisal, the scene may be 


tive behavior of the participants. 


re-enacted or a new situation created 
for perfecting the necessary skills. The 
process then continues through enact- 
ment, diagnosis, evaluation, and re-en- 
actment. The major advantages of this 
method are that the student is encour- 
aged to experiment with new ways ol 
behaving under less than real-life condi- 
tions, and that confidence can be gained 
by perfecting skills prior to their actual 
use in future situations. 


B. Results. 


1. Quantitative. Vhe scores of the 
experimental and control subjects for 
the three experiments were grouped to- 
gether, and statistical tests of significance 
were computed for the differences be- 
tween the gains made by the experi- 
mental and control groups. Ratings on 
Part A and Part B of each section of the 
Leader Rating Scale were compared. In 
this way, changes in sensitivity to group 
needs could be differentiated from 
changes in the quality of leadership pro- 
vided. 

The t-test for the significance of the 
difference between the means of indepen- 
dent small samples was used to compare 
the gains made by experimental and 
The 


each 


control subjects. value of tf 


Parts A and B of section of 
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for 
the 


Leader Rating Scale will be found in 
Table IIL. 


rABLE III 
COMPUTED VALUES OF ¢t FOR THE GAINS 
EXPERIMENTAL AND CONTROL GROUPS ON 
BARNLUND-HAIMAN LEADER RATING SCALI 


OF 
THE 








Functions of Leadership t-values 


Initiating Discussion 
Part A 
Part B 
Climate-Making 
Part A 
Part B ? 
Regulating Amount of Participation 
Part A 
Part B 
Stimulating Group 
Part A —.44 
Part B 1.75 


aa 


-78 
3.65* 


2.00 
2.62* 


Thinking 


Directing Group 
Part A 
Part B 
Summarizing 
Part A —.57 
Part B 0.00 
Resolving Conflict 
Part A 2.15% 
Part B 
Overall Leadership Ability 
Part A — 
Part B 
Total Gain 
Part A 58 
Part B 


Chinking 
81 
O4 








*Significant at the 5°% level. 
**Significant at the 1% level. 
A of “Re 
Part B ol 


“Regulating 


‘The t-values obtain on Part 


solving Conflict,” and on 


“Initiating Discussion,” 
Amount of Participation,” and “Resolv- 
ing Conflict,” were found to be statisti- 


cally significant. The (t-values obtained 


on Part A of “Regulating Amount of 
Participation,” and on Part B of “Cli- 
mate-Making,” “Stimulating Group 


Thinking,” 
ty,” 


“Overall Leadership Abili 
and “Total Gain,” were positive 
and large, though not statistically signil- 
icant. 

The values of t for “Initiating Discus- 
sion,” “Regulating Amount of Partici- 
pation,” and “Resolving Conflict,” are, 
therefore, sufficiently large to cause re- 
jection of the null hypothesis. The like- 
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that this difference is due to 


chance is very remote; and it is safe to 


lihood 


conclude that the experimental subjects 
gained significantly over the controls in 
these respects. The values of ¢ which 
were positive and large, but not statis- 
tically significant, suggest that there may 
be a difference between experimental 
and control subjects on sensitivity to 


group needs for “Regulating Amount 
of. Participation,” and on skill in “Cli- 
mate-Making,” and “Stimulating Group 
Thinking,” as well as on “Overall Lead- 
On 


the basis of this evidence it is not safe 


ership Ability,” and ‘““Votal Gain.” 


to conclude, however, that these difter- 
Addi- 
tional research, employing larger sam- 


ences did not result from chance. 


ples and tighter experimental controls, 
is needed to check these results. 
2. Qualitative Results. <A 


training which aims at the alteration of 


course ol 


social attitudes and the improvement of 
skills 


quately by quantitative methods alone. 


social cannot be appraised ade- 
\lthough students’ evaluations may not 
be statistically reliable, they are, never- 
theless, important. 

Replies to the Evaluation Sheet which 
was mailed to the experimental subjects 
after they had completed the Leadership 
[raining Course provide some revealing 
comments on the training. Experimen- 
tal subjects agreed unanimously that the 
course had been of value and many sug- 
gested its expansion and inclusion in 
the curricular program. ‘They reported 
that the training had increased their sen- 
sitivity to leadership problems and to 
social behavior in general, improved 
their skill in coping with some of the 
responsibilities of leadership, and_pro- 
duced more objective attitudes toward 
their own behavior and the behavior of 
those about them. In addition, several 
students claimed that the training had 


helped them greatly in meeting prob- 
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lems of a similar nature in their frater- 
nities, dormitories, and student govern- 
ment. ‘he only suggestion for improve- 
ment was that the training be extended 
for a longer period of time so that great- 
er skill could be acquired. 

C. Sources of Error. 

The most important source of error 
in appraising the effects of leadership 
training appears in the evaluation pro- 
cedure. Statistical methods are highly 
desirable ways of summarizing experi- 
mental findings, yet the suitability of 
tests of significance depends upon the 
degree to which the statistical assump- 
The 


t-test assumes a sizable and normally 


tions of these formulae can be met. 


distributed sample, and the fact that this 
test takes small sample errors into ac- 
count does not completely protect 
against such errors. ‘The qualitative re- 
ports of the subjects, observers, and in- 
structors do, however, substantiate most 
of the large and positive statistical re- 


sults. 


IV. CONCLUSIONS 


The 


decision-making 


investigation of leadership in 


discussion groups re- 
quired the formulation of a satisfactory 
theoretical framework, the discovery of 
the functional requirements of leader- 
ship in decision-making groups, and the 
measurement of the effects of training 
on skill in leadership. 

A review of the historical lines of in- 
quiry and an examination of the nature 
of group relationships led to the hy- 
pothesis that leadership represents a 
unique functional relationship existing 
among the members of a group—the 
specific nature of this functional rela- 
tionship depending upon the character 
of the problem before the group and 
the techniques of adjustment common 
to that group. 


Data collected by questionnaire and 
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observation indicates that the function- 
al requirements of leadership in de- 
cision-making groups can be isolated 
and described. They include such activi- 
ties as initiating group action, arranging 
for the mechanics of operation, climate- 
making, regulating participation, stimu- 
lating group thinking, guiding group 
thinking, information-seeking and eval- 
uating, clarifying and resolving group 
conflicts and summarizing group think- 
ing. The skill with which these functions 
are performed is likely to influence se- 
riously the quality of decisions arrived 
at by a group and the level of satisfac- 
tion experienced by the members. 

On the basis of an experimental study 
of the effects of training on the leader- 
ship ability of ineffective leaders, it is 
possible to conclude that: 

(1) Procedures can be devised for se- 
lecting, evaluating, and training group 
leaders. 


(2) Leadership training, based on the 
assumptions and procedures of this 
study, does improve the ability of ineffec- 
tive student leaders to perform some of 
the functions of leadership in decision- 
making groups. 

The results of this exploratory study 
have certain educational and social im- 


plications. If the operation of essential ' 


social processes can be improved by 
providing more effective leadership, 
then our educational system should se- 
riously consider the value of providing 
further opportunity for 
group leadership. In the field of speech 


training in 


and communications, where much of the | 


training actually involves preparation 
for leadership, more attention to the re- 
quirements and techniques of leadership 
seems warranted. Certainly, training for 
more effective democratic leadership 
needs no rationalization in our contem- 
porary society. 
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A CONSIDERATION OF THE RHETORICAL CAUSES 


OF BREAKDOWN 


IN DISCUSSION* 


EDWIN BENJAMIN BLACK 
Cornell University 


ROUP discussion, as an object of 

study, presents as varied and com- 
plex a phenomenon as appears on the 
human scene. Scholars from diverse dis- 
ciplines have investigated it with their 
sights on the personalities of the partici- 
pants, 
effects, 


on its educative and persuasive 
and on a number of other aspects 
of the process. Each point of view, as 
it has been brought more sharply into 
focus, has provided an enriched under- 
standing of group discourse. 

This study was motivated by the ques- 
tion of whether there is a body of char- 
acteristics common to discussion and to 
public address; whether, indeed, one 
can intelligently use the language of 
rhetorical theory in talking about group 
Casual at 
justifies the assumption of some relation- 


discourse. reflection, least, 
ship between rhetoric and discussion, 
since an essential feature of each is the 
manipulation of language symbols. A 
long history of practical application fur- 
ther indicates the utility of rhetorical 
theory. These considerations, insofar as 
they are valid, point to the conclusion 
that an investigation of some aspect of 
group discussion conducted from a spe- 
cifically rhetorical point of view should 
prove fruitful. Induced by this expec- 
tation, the primary concern of the pres- 


' ent study was with the rhetorical causes 


Rhetorical 
taken to 
affecting the choice of 


of breakdown in discussion. 


considerations were be those 
the available 
means of persuasion and exposition, and 
the investigation sought to determine 


whether, and in what ways, these may 


*Based upon M.A. thesis, Cornell University. 
1953, directed by Carroll C. Arnold. 


cause or contribute to disruptions in 
group discourse. 

Once an investigator has crystallized 
the questions which he wishes to answer, 
his next problem is the selection of 
relevant and meaningful data for his 
inquiry. The raw materials for this study 
were assembled by auditing some thirty- 
five classroom discussions recorded over 
a period of several years at Cornell Uni- 
versity. All major types of discussion 
were represented in the sampling, in- 
cluding closed-group discussions, public 
discussions, and colloquies. Portions of 
these made dis- 
cernible contribution to group under- 


discussions which no 
standing or to the co-operative deliber- 
ation of a problem were defined as 
breakdowns. These breakdowns were 
isolated and submitted to three judges 
for verification as meeting the terms of 
the definition. Nineteen passages of dis- 
cussion, each containing one or more 
breakdowns, were retained for detailed 
analysis, each having been unanimously 
confirmed by the judges. ‘hese were the 
materials studied for rhetorical factors 
or configurations occurring frequently 
enough to suggest causes of the disrup- 
tions. The examination of these passages 
was analytical, not experimental, since 
its object was to discover hypotheses 
rather than to test them. 

When the portions of unproductive 
discussions were compared, it appeared 
that some key contributions, while con- 
tinuous, fluent, and coherent, seemed to 
contribute nothing to the purposes ot 
the These _ irrelevancies 
were, of course, digressive breakdowns. 
There were other individual contribu- 


discussions. 
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tions which did not initiate a futile line 
of inquiry, but which seemed discon- 
tinuous, which were ignored by the 
group, apparently misunderstood by the 
group, or which the group explicity re- 
fused to consider. These comments were, 
in short, ambiguous. 
There seemed to be distinct differ- 
ences in the rhetorical character of the 
digressive and the ambiguous contribu- 
tions. While there was some overlap- 
ping, most of the breakdowns fitted 
rather clearly into one or the other 
So neat a dichotomy is 
but it 


classification. 
certainly open to suspicion, 
proved to be convenient for this study. 
The implications of this dichotomy’s 
having some basis in reality as well as 
in convenience will be discussed later. 

Beyond suggesting a possible way of 
classifying disruptions, detailed exami- 
nation of the breakdowns gave rise to 
several tentative hypotheses concerning 
the unsuccessful 
communication 
these hypotheses were obvious and are 
familiar to all who study group discus- 


rhetorical causes of 


in discussion. Some of 


sion. That failure to agree upon the 
meaning of a term and that lack of an 
objective referent for a term may lead 
to confusion in discussions are typical 
of the commonly accepted and familiar 
hypotheses supported by this research. 
However, three other hypotheses con- 
cerning causes of breakdown in discus- 
sion, perhaps less familiar to students 
of the process, also emerged; it is to 
these that the remainder of this essay 
will be devoted. 

When the digressive breakdowns were 
examined, they were found, without ex- 
ception, to be preceded by contribu- 
tions which ended on a level of com- 
munication which was less abstract than 
the level on which the discussion had 
been proceeding. Hence, this hypothesis 
was suggested: An individual contribu- 


tion which ends on a level of communi- 
the level on 
which the discussion had been proceed- 


cation less abstract than 
ing is likely to cause a breakdown. 
What this hypothesis says, in effect, is: 
from the 
general to the specific without returning 


when a discussant descends 
to the general, a digression is likely to 
result. The members of the group be- 
come involved in particulars, frequent- 
ly allowing minutes to obscure the ulti- 
mate aims of the discussion. 

The configuration described by this 
hypothesis was present in all cases of 
digression studied. Assuredly, this is not 
to say that every time such a configura- 
tion occurs, a digression will inevitably 
follow; but, the fact that each digression 
was preceded by verbal communication 
which descended from the abstract to 
the specific without returning to the 
abstract strongly suggests that here may 
be a contributing cause of digressive dis- 
ruptions. 

An especially interesting pattern also 
recurred among the digressive break- 
downs. Often the absence of a suitable 
general term by which to denote, ab- 
stractly, a complicated concept or process 
bogged 
down in a mass of inutile details. Per- 


forced discussants to become 
haps the following example from one 
of the discussions will serve to clarify 
the difficulty under consideration here. 

The discussants were trying to find 
the best means of correcting inflation. 
They were concerned with the economic 
system: a dynamic process. The discus- 
sion flowed from a consideration of dol- 
lar value to government spending, to 
credit, to consumer. spending, to taxes, 
with few questions being resolved. The 
discussants were virtually drowning in 
detail because, other 


a sea of among 


1 Those familiar with the Process of Ab- 
straction described by general semanticists 
will recognize that I have been influenced in 
phrasing by this concept. 
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reasons, it is the peculiar nature of an 
economic process that no single element 
in it can be entirely understood apart 
from its function in the process as a 
whole. The alteration of any of the var- 
iables affects the process and all other 
variables within the process. The dis- 
cussants could not surmount this diffi- 
culty. They had no terms with which to 
denote the entire process on which the 
discussion was centered. Hence, while 
the discussants seemed to be aware that 
there is an interaction between dollar 
value and government spending, their 
common vocabulary did not permit them 
to denote this interaction in a way which 
all would understand; they were com- 
pelled to discuss dollar value and gov- 
ernment spending as two disparate vari- 
ables. The result was a preoccupation 
with details to the detriment of the dis- 
cussion. 

This example illustrates one pattern 
of digression. Its repeated appearance 
supported the hypothesis that: The ab- 
sence in a discussion of a commonly ac- 
cepted and sufficiently inclusive term to 


denote a abstract 


relevant process or 
complex ideological structure is likely 


to cause a breakdown. 


If this hypothesis should be true, it 
would have some interesting implica- 
tions for students and teachers of group 
discussion. Like the earlier hypothesis, 
this, too, intimates that verbal concrete- 
ness can, under certain circumstances, be 
just as destructive as excessive abstrac- 
tion, and both suggest important qualifi- 
cations of our injunction against ab- 
straction and in favor of concrete termi- 
nology. Also, they suggest a re-examina- 
tion of traditional modes of attacking 
problems in discussion. 


The presently proposed hypothesis, 
especially, indicates that there must be 
some modification of the customary 


stress on analyzing, before solutions are 


‘ 


considered, as many aspects of a prob- 
lem as come within the awareness of the 
group. Perhaps there is a point beyond 
which problem analysis becomes detri- 
mental to discussion. Even so prosaic a 
question as whether to put on a blue or 
a brown suit in the morning could, if 
one would let it, quite logically lead to 
a consideration of social mores, aesthet- 
ics and epistemology. Indeed, one could 
very conceivably spend the rest of his 
life analyzing the problem without ex- 
hausting the topics relevant to the de- 
cision he must make, and die aged, wise, 
and undressed. ‘Thus, the point at which 
analysis of the problem ceases and 
deliberation of solutions begins will be 
an arbitrary point. The hypothesis sug- 
gests that the extent of problem analysis 
in discussion should be conditioned by 
the quantity of data which the discus- 
sants can manipulate. 

In addition to the suggestions that 


unrecovered descent from abstraction 
and lack of comprehensive terminology 
may cause discussions to falter, the pas- 
sages of hint at a 


third hypothesis, describing a rhetorical 


discussion studied 
cause of ambiguity. 

One way of seeking the source of am- 
biguity or loose thinking in general is 
to look at logic. In an effort to discern 
whether any of the logical constructions 
in discussion contributions caused break- 
downs, an attempt was made to analyze 
the materials of the study by the stand- 
ards of formal, Aristotelian logic. The 
attempt yielded nothing. Not a single 
syllogism was to be found in the pas- 
sages, and none of the cases of induction 
met the ordinary tests of formal validity, 
even when the contribution proved help- 
ful in the discussion. The conclusion, 
then, was that discussion, like public 
speaking, rarely utilizes formal logic. 

A positive finding was that the rhe- 
torical counterparts of induction and de. 


DETROIT PUBLIC LI BRARY 
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duction—example and enthymeme, re- 
spectively—appeared as the basic logical 
elements of the discussions. The sugges- 
tion of a common logical system in pub- 
lic speaking and discussion points to a 
more than adventitious relationship be- 
tween traditional rhetoric and discus- 
sion. 

The very use of enthymemes instead 
of scientific syllogisms introduces the 
hazard of ambiguity. More than this, the 
use of enthymemes in discussions con- 
cerning human affairs appears to intro- 
duce some special problems related to 
the clarification of values. It seemed to 
be characteristic of the discussions stud- 
ied that the major premise of each 
enthymeme stated a value, while the 
minor premise defined a specific instance 
and the conclusion applied the value to 
the specific instance. Rarely were the 
conclusion and both premises explicated; 
usually, they were left for the listeners 
to infer from only one of the premises. 
It would, of course, make no difference 
whether the premises were stated or 
whether one or both were tacitly under- 
stood. The argument would be an en- 
thymeme in either case, since authori- 
ties now seem generally agreed that 
truncation is not the distinctive factor 
differentiating an enthymeme from a 
scientific syllogism.? 

But what occurs when the value prem- 
ise is not perceived or accepted by lis- 
teners? One would suppose that collab- 
orative thinking must break down. This 
is, in fact, precisely what happened in 
some of the discussions audited, and this 
pattern of breakdown occurred frequent- 
ly enough to suggest the following hy- 
pothesis: The failure to explicate the 
major premise of an enthymeme when 

2Edward H. Madden, “The Enthymeme: 
Crossroads of Logic, Rhetoric, and Metaphysics,” 
The Philosophical Review, LXI (1952), 368- 
376; James H. McBurney, “The Place of the 


Enthymeme in Rhetorical Theory,” Speech 
Monographs, Ill (1936), 49-74. 


the premise is not perceived to be a 
value of the discussants is likely to 
cause a breakdown. 


Not only did misunderstanding of 
value premises cause breakdowns, but 
divergence of values also contributed 
to ambiguity, particularly when the di- 
vergence was not recognized by discus- 
sants. ‘The data employed in this study, 
as well as theoretical considerations, 
make it appear that a certain minimum 
of agreement on values is necessary in 
a discussion. Unless discussants have at 
least this minimal consonance of values, 
they are unable even to agree that the 
problem which they are to deliberate 
exists. 

All of these considerations seem to 
mean, for discussion, that the failure to 
explicate the first premise of an enthy- 
meme, either when the value contained 
therein is not recognized or when it is 
not shared by the discussants, can cause 
a breakdown characterized by ambiguity. 

In summary, an attempt was made to 
detect sources of disruption in the rhe- 
torical elements of group discourse. The 
study indicated that breakdowns have 
their sources in, among other things, 
a descent in verbal abstraction without 
a subsequent ascent, the lack of compre- 
hensive terminology to denote a complex 
of ideas or a process, and the failure to 
verbalize value premises which are not 
perceived or shared by the listeners. 
How many of these and other concepts 
of traditional rhetoric and how much of 
its vocabulary are capable of being trans- 
ferred to discussion? The query has 
never been answered, and studies lead- 
ing to its determination might prove 
profitable. the conclusions 
from this investigation cannot be taken 
as final; they are untested and, hence, 
but tentative. However, should experi- 
mentation 


Certainly 


validate these hypotheses, 


we shall have advanced another step 
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toward a rhetorical theory for group 
discussion. 

An intriguing problem 
raised by this research is that of the re- 


especially 


lationship between discussion and rhe- 
torical logic. The data indicated that 
breakdowns following from misused ex- 
amples are digressive, and that break- 
downs following from misused enthy- 
memes are characterized by ambiguity. 
May it not be significant that the two 
types of should 
bear so close a relation to the two types 
of proofs in rhetoric? The Aristotelian 
formula is: “Whenever men in speaking 
effect persuasion through proofs, they 


breakdowns seem to 


do so either with examples or enthy- 
The 
this study is: 


memes; they use nothing else.’’* 
formula suggested by 


3 Lane 
Aristotle 


trans., The 
1932), Pp. 10. 


Cooper, 
(New York, 


Rhetoric of 


Whenever breakdowns occur in discus- 
sion, they occur either as digressions or 
ambiguities, and nothing else. Some of 
the questions raised when these two 
propositions are contemplated side by 
side are: Is there a consistent relation- 
ship obtaining between the forms of 
rhetorical reasoning and the types of 
breakdown in discussion? Do misuse of 
example and misuse of enthymeme pro- 
duce distinctive breakdown? 
Do all breakdowns in group delibera- 


types of 


tion have misuses of rhetorical logic at 
their source? Here, certainly, are prob- 
lems enough for several inquiries con- 
cerned with the relationship between 
rhetoric and discussion—inquiries which 
might, in the deline- 
ation of a systematic rhetoric of discus- 


time, lead to 


sion, a theoretical formulation to guide 
future research. 








SELF-CONCEPTS OF COMMUNICATION SKILL AND 
PERFORMANCE IN SMALL GROUP DISCUSSIONS 


LAURA CROWELL, ALLAN KATCHER, and S. FRANK MIYAMOTO* 
University of Washington 


L. is common observation that people 
make judgments about the way that 
they and others express themselves. Such 
judgments may indeed be related to the 
individual’s behavior in communication 
situations. There is, in fact, a body of 
theory, perhaps best associated with the 
name of G. H. Mead,? which holds that 
the individual’s behavior is affected by 
the kind of image he has of himself, an 
image derived largely from his percep- 
tion of others’ attitudes toward him. 

In this investigation we have attempt- 
ed to examine the individual's self-image 
relative to the two major aspects of com- 
munication—the and the 
reception of information—and to ask 
how the individual sees himself as com- 
municator We 
back- 
ground factors relate to self-concepts 


transmission 


and 
have further 


as communicant. 


asked how certain 
and, in turn, how these self-assessments 
relate to his performance in group dis- 
cussion. 

It may be hypothesized that those 
who feel themselves inadequate com- 
municators will not be likely to express 
themselves effectively; that 
those who view themselves as poor lis- 
teners may be somewhat disoriented with 
respect to the course of thinking in the 
group. Whether or not self-images are 
realistic, it may be expected that they 


similarly, 


*This program of research is being supported 
by a grant from the Agnes Anderson Fund, 
University of Washington. The investigators 
are from the Departments of Speech, Psychology, 
and Sociology, respectively. We gratefully ac- 
knowledge the cooperation of the fall quarter 
(1954) psychology class in Personality, and the 
research assistance of Miss Elizabeth fJallie. 

1 George H. Mead, Mind, Self and Society 
(Chicago, 1937). 


play an important part in determining 
the nature of the individual’s communi- 
cative behavior. Such a conceptualiza- 
tion may have utility in understanding 
the forces in group discussion. 

Although this report concerns a pilot 
investigation on a small sample only, its 
plan is indicative of a larger program 
of research now under way with the co- 
operation of the Department of Air 
Science at the University of Washington. 
In the latter study a sample of 194 col- 
lege men is being utilized for similar 
analyses. In addition, we are exploring 
the relationships between self-concepts 
of communicative skill and leadership 
not only through group evaluations (as 
in the study reported here) but also 
through analyses of participation in ac- 
tivities, grades in the reserve officers’ 
study program, cadet rankings, and lead- 
ership assessments by officers and fellow 
students. Under investigation also are 
relationships between self-concepts and 
measures of personality variables and of 
linguistic skills. Furthermore, a study 1s 
under way in ten Basic Speech Improve- 
ment the effect of 
training on self-concepts of communi- 
cative skill. 


classes to measure 


I. METHOD 


A. The Tests. Two scales were devel- 
oped for the purpose of measuring a 
person’s evaluation of his own ability 
(1) as a speaker and (2) as a listener. 
These are called, respectively, the Com- 
municator and Communicant Scales. 

As an initial step in the construction 
of the communicator scale, a group of 34 
psychology students was asked to list 
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as many statements as possible of the 
kind which they characteristically make 
about their own and other people’s com- 
the several 
submitted, and 
others formulated by the investigators, 
94 were selected and reworded to consti- 


municative ability. From 


hundred statements 


tute a preliminary questionnaire. The 
selection of items was guided by a theo- 
retical conception of the important com- 
ponents involved in communication,’ 
the theory being used mainly to insure 
that every component was well repre- 
sented with items. 

All items were presented in the form 
of the following samples: 

Ch NC DK 6g. I usually feel in- 
hibited when I am expected to 
contribute something to a discus- 
sion. 

Instruction was given to circle the Ch 
if the subject considered the statement 
characteristic of himself, the NC if it 
was considered and 
the DK, “don’t know,” only if the sub- 


ject really could not decide. In scoring 


not characteristic, 


the items, two points were arbitrarily 
assigned to all responses which reflected 
a high self-concept of communicator 
ability, one point to a DK, and zero toa 
low self-concept response. 

The preliminary questionnaire was 
pretested on two different groups of stu- 
dents, totalling 240 persons, and the re- 
sponses from each group were subjected 
to separate item analyses by comparing 
the answers of those in the top quartile 
of total scores against those in the low- 
est. By this procedure, 31 of the least 
discriminating items were eliminated, 
leaving 63 questions in the final scale. 

The analysis of the test-retest reliabili- 
ty of this scale is incomplete and cannot 


2 Eugene L. Hartley and Ruth E. Hartley, 
Fundamentals of Social Psychology (New York, 
1952). Also see Theodore M. Newcomb, “An Ap- 
proach to the Study of Communicative Acts,” 
Psychological Rev., LX (1953), 393-404. 


be reported, but a split-half reliability 
measure computed for the 194 question- 
naires of the Air Force group—from the 
date of our larger study—using 62 of the 
63 items retained in the final scale, gave 
r= .g2 (Spearman-Brown formula). 

Essentially the same procedure was 
followed in the construction of the com- 
municant scale which, in its final form, 
included 75 items. The split-half test on 
the communicant the 37 
subjects in the present study yielded a 
correlation of r = .87. 


responses of 


In addition to the two scales, ques- 
tions were included which inquired into 
the age and sex of subjects, age of sib- 
lings, father’s occupation, and parents’ 
educational level. Two other series of 
questions probed the parental emphases 
on communication—careful pronuncia- 
tion, thinking before speaking, etc.— 
and on the characteristics of the child- 
hood family conversation. 

Since it was a major concern in this 
study to investigate the relationship be- 
tween the two scales and the perform- 
ance of members in discussion groups, a 
12-item questionnaire devised by Blake, 
Mouton, and Fruchter? (see Table III 
for the items included) which seemed 
suitable for getting performance ratings 
on group members, was also employed. 
These authors determined the reliability 
of their ratings by studying three-man 
groups of strangers who interacted in ses- 
sions lasting only fifteen minutes. De- 
spite the brevity of the interactions, 
they report consistently reliable ratings 
on such items the 
group decision, clerical work, leadership, 
and dominance. 


as contribution to 


The judgments were 
least reliable for satisfied with group de- 
cision, frustrated, and good job. A\l- 


3 Robert R. Blake, Jane S. Mouton, and 
Benjamin Fruchter, “The Consistency of In- 
terpersonal Behavior Judgments Made on the 
Basis of Short-Term Interactions in Three-Man 
Groups,” Jour. of Abnormal and Social Psychol- 
ogy, XLIX (1954), 573-578- 
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though our groups, consisting of 12 or 
13 members each, were larger than those 
studied by Blake, Mouton, and Fruchter, 
our subjects interacted a considerably 
greater length of time. There seems 
every reason to believe, therefore, that 
our ratings are at least as reliable as 
those reported by Blake, Mouton, and 
Fruchter. 

Finally, every member submitted an 
introspective report of his experience in 
his group. These proved a helpful source 
of suggestions regarding some factors 
which may have influenced the ratings 
received by the various members. 


B. The Subjects. The subjects in the 
present investigation were 37 students, 27 
males and 10 females, enrolled in a psy- 
chology of personality course at the 
University of Washington. Members of 
the class were divided into three discus- 
sion groups, each of which had the as- 
signed task of considering the theoretical 
views about personality developed by 
Freud, Lewin, or Dollard and Miller. 
Subjects had free choice of groups but 
were encouraged to join the one study- 
ing the theorist least familiar to them. 
This resulted in a high concentration of 
psychology majors in the Lewin group. 

Each group was responsible for con- 
ducting a series of intensive discussions 
on the major concepts and research in 
their respective systems, and for prepar- 
ing a group paper that carefully sum- 
marized the point of view. In addition, 
the groups were required as a final task 
to present their findings before the en- 
tire class. Exclusive of the regular class 
meetings, it is estimated that the amount 
of time spent by the members in group 
participation ranged between 20 and 30 
hours. The instructor was available for 
and if needed, but 
his direction over the groups was gener- 
ally limited to a few initial suggestions 
that each group designate a chairman 


consultation when 


and recorder, rotate these functions dur- 
ing the series of meetings, and have 
everyone participate as actively as possi- 
ble. To provide incentive, the students 
were told that their work in the groups 
would have one-quarter weight in their 
final grades. 

While the sampling deficiencies here 
are quite apparent, the groups appeared 
sufficiently heterogeneous and random 
to serve for a pilot study. 

C. Administration of Tests. The 


communicator scale was distributed 
about midway through the course, and 
the communicant scale at the end, just 
before the close of the term. The ratings 
on group performance were administered 
after the completion of all group ses- 


sions. 
II. RESULTS 
A. Relationship between Communi- 
The 
total scores for our subjects on both the 


cator and Communicant Scales.* 
communicator and communicant scales, 
computed by the procedure described in 
the methodology section, revealed wide 
individual differences of self conception. 
The communicator scores ranged from 
24 to 111, with a mean of 75.4 and a 
24.5; 


communicant scores extended 


standard deviation of while the 
from 70 
to 137, with a mean of 113.2 and a stand- 
The 


of the communicator scale 


ard deviation of 15.1. lower mean 


is partly ac- 
shorter maximum 


counted for by its 


range, which was 0 to 126 by contrast 
the communicant 


scale, but even if an adjustment is made 


with o to 150 for 
for this difference of range, the mean of 
the communicator scale is substantially 
lower and the standard deviation much 
larger than for the communicant scores. 


4Whereas the number of subjects partic- 
ipating was 37, we were unable to obtain com- 
municator and communicant scales for one of 
the subjects. In the discussion of the group 
ratings, however, all 37 entered into the com- 
putations. 
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This difference between the two scales 
may be due entirely to the nature of the 
items included in the respective scales, 
but there are reasons to believe that for 
any random selection of items for the 
two scales, the communicator scale would 
generally show a lower average self-con- 
cept of ability and greater variation 
than the communicant scale. It appears 
that people tend to think better of them- 
selves as listeners than as speakers, and 
also that they are less able to discrim- 
inate between good and poor ability in 
listening than in speaking. It may be 
that people are not as accustomed to 
making judgments about, or receiving 
comments on, their communicant skills 
as they are about their communicator 
skills, and, further, they may be lacking 
tangible checks on their efficiency as 
communicants. 


The product-moment correlation be- 
tween the two scales was r = .38 (signif- 
icant at the .o5 level). That a higher 
correlation was not found suggests that 
self-evaluations of communicator and 
communicant abilities differ, but, fol- 
lowing our above line of thought, the 
low correlation may also be due to the 
fact that self-judgments of communicant 
performance may be more randomly dis- 
tributed than 
behavior. 


those of communicator 

B. Item Analyses. Although no items 
were left in the scale which failed to 
differentiate in some degree between low 
and high self-concept groups (above and 
below median scores), certain ones 
proved to be more discriminating than 
others. A few examples of differentiating 
the scale 


statements in communicator 


are listed below in Table I, together 


TABLE I 


EXAMPLES OF 


ITEM DISCRIMINATION FROM COMMUNICATOR QUESTIONNAIRE. 








Item 
In a discussion I prefer to let 
other people do the talking 


I often feel inhibited when I am expected to 
contribute something to a discussion 


In conversations and discussions, 
I “talk to the point” 


I have no trouble keeping 
conversations going 


I don’t let fear of saying the wrong thing 
stop me from saying what I mean 


It’s easy for me to persuade others 
to accept my point of view 


I find it easy to change my language if 

I see that my ideas are not getting across 
I often know what I want to say, 

but not how to say it 


When I talk I give a favorable 
impression of myself 


I have little difficulty putting 
complex ideas into words 


It wouldn’t bother me to have to deliver 
a speech before a large audience 


High Concept Low Concept 





Group ___ Group 
Low Low 
Concept Don't Concept Don't 
Choice Know Choice Know 
I 3 10 2 
1 2 9 1 
1 3 4 7 
3 1 10 I 
3 1 13 a 
3 5 11 5 
2 4 i. 2 
2 { 11 2 
1 4 4 11 
2 3 13 2 
5 4 16 1 
N = 18 N= 18 














24 SPEECH MONOGRAPHS 


with the relative scores of the high and 
low concept groups. We have selected 
for illustration those items which ap- 
pear to be most relevant to group dis- 
cussion, although one item on “stage 
fright,” reflecting another aspect of 
speech behavior in which self-concepts 
are important, has also been included. 


Analysis of these and other most dis- 
criminating items on the communicator 
scale indicated that the high and low 
concept groups differed greatly in self- 
confidence with regard to communicative 
ability and behavior, both generally and 
in specific situations. Further, low con- 
cept people appeared to characterize 
themselves as highly anxious about social 
status and as lacking the essential skills 
necessary to create a favorable impres- 
sion on others. They seemed to empha- 
size their paucity of ideational material, 
limited vocabulary, and inability to or- 
ganize and control their language. We 
suspect that clinical analysis would re- 
veal these people as high in self-abase- 
ment, highly deferential, and somewhat 
constricted in their approach to intel- 
lectual problems. 


While 26 of the communicator ques- 
tionnaire items were discriminating at 
the levels illustrated in Table I, only 


six of the communicant items were as 
effective. The results for four of these 
questions are presented in Table II. 
Because of overlap the other two are 
not presented. One gains an impression 
from these items and from introspective 
reports that the low concept person on 
the communicant questionnaire reveals 
himself as insensitive. He 
seems to be self-centered, insecure, and 
concerned primarily with the picture he 
creates in others rather than with the 


generally 


substance of their conversations. 


C. Background Factors and Question- 


naire Scores. Information concerning 
the amount and kind of parental em- 
phases given to communicative skills, 
supplemented by indications from the 
introspective reports, suggests that fam- 
ily influences play a part in determin- 
ing self-concepts with respect to com- 
municator skills. 

Position among siblings did not mark- 
edly affect self-concepts of communicator 
skill, although high self-concept people 
were more frequently first-born children. 
Parents of high concept children had 
slightly more education at the collegiate 
level than those of low-concept children, 
although the trend was slight. In general, 
parents of high concept children empha- 


TABLE II 
EXAMPLES OF ITEM DISCRIMINATION FROM COMMUNICANT QUESTIONNAIRE. 








Item 


I generally feel that people who are dull and uninteresting 


speakers aren’t worth listening to 


High Concept Low Concept 





Instead of listening to the other person, I often find 


myself thinking of what I am going to say 


Even if I don’t understand what a person means 


I try to give the impression that I do 


I have a habit of being overcritical 
of what other people say 


____ Group _ ; Group 
Low Low 
Concept Don't Concept Don't 
Choice Know Choice Know 
1 3 10 1 
° 4 7 2 
3 1 10 1 
6 2 14 o 
N= 18 N = 18 
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sized accurate phrasing of ideas, the 
importance of thinking before speaking, 
the importance of everyone expressing 
his opinions, and the importance of 
witty and interesting speech more than 
did the parents of low concept children. 
It is also apparent that the families of 
high concept people frequently talked 
about a variety of topics, including fam- 
ily matters, politics, abstract ideas, and 
other people. In addition, the verbal 
expression of high concept children was 
apparently less suppressed by their par- 
ents than was the case for low concept 
children. With the few cases at hand it 
is difficult to know whether these results 
are of any significance, but it would ap- 
pear that high concept children re- 
ceived more practice and training in 
communicative skills than did the low 
concept ones and practiced them under 
more favorable conditions. 

To some extent those with high con- 
cept communicant scores also experi- 
enced more emphasis and reward than 
did low concept people, but the results 
were not substantially different for the 
two groups. 


D. Relationships between Question- 
naires and Group Performance. Table 
III presents the between 
each of the scales and thirteen ratings 
of performance received by each mem- 


correlations 


ber from his group, including his own 
self-rating. Mean ranks of the subjects 
were converted to T scores before com- 
putation of the correlation coefficients. 
While the inclusion of these self-ratings, 
necessitated by assurance to the class of 
member anonymity, may have tended 
to inflate the correlations, Blake, Mou- 
ton and Fruchter’s finding of high con- 
sistency between the ratings of a mem- 
ber’s performance given by others and 
by himself suggests that this is not a 
serious error. 

Self-concepts of communicator skill 


TABLE Ill 


CORRELATIONS BETWEEN QUESTIONNAIRES AND 
RATINGS OF PERFORMANCE IN GROUP DISCUSSIONS. 








Communicator Communicant 

Behavior Rated Scale Scale 
Contribution to 

Group Decision 59 OF 
Contribution to 

Understanding 46 Ol 
Clerical Work 18 —.02 
Leadership 68 .20 
On the Ball 57 13 
Agreeable 

and Willing —.32 —.O4 
Dominant 32 22 
Desire to do 

Good Job 2 07 
Cooperative .09 Ol 
Satisfaction with 

Group Decisions —.07 —.01 
Frustration —.28 . 38 
Interest .48 OF 
Yielded —.33 02 


Correlation Between Communicator and 
Communicant Scales: .98 








were related positively and substantially 
with the ratings received for their (1) 
contribution to group 
contribution to the understanding of 
the theorist, (3) leadership, (4) ability 
to keep the group on the ball, (5) 


decisions, (2) 


desire to do a good job, and (6) interest 
in the task. Introspective reports of the 
subjects are in agreement with the rela- 
tions suggested by the negative correla- 
tions, namely, that high concept sub- 
jects were not considered the most agree- 
able and willing, nor were they the peo- 
ple who yielded most to the others in 
the group. 


Except for the low positive correla- 
tion with frustration, there appears to 
be little relationship between communi- 
cant scores and performance in group 
discussions. These low correlations may 
reflect either the inability of the sub- 
jects to assess themselves on communi- 
cant behavior or the insensitivity of the 
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communicant scale. It is also possible 
that the ratings do not lend themselves 
to revealing the contributions which 
good communicant behavior can make 
to group discussions. This may be due 
to the nature of the rating items them- 
selves, or to the difficulty of assessing 
communicant behavior apart from com- 
municator behavior. Those who listen 
well but provide few cues of their 
understanding to the group would prob- 
ably receive low ratings on these items 
from the other members, and thereby 
result in a low or negative correlation 
between these items and the communi- 
cant scale. Perhaps rating items such as 
the following might relate more highly to 
the communicant scale: who gave ideas 
the fairest hearing? who most helped 
the group to maintain continuity in 
its thought process? who did most to 
encourage contribution from others? 

An _intercorrelational analysis was 
made of those items reported by Blake, 
Mouton and Fruchter the most 
reliable.’ They reveal high intercorrela- 
tions (averaging approximately .85) for 
all variables except dominant behavior, 
which is positively related to the others, 
but to a lesser degree (approximately 
-55). It is conceivable that these items 
are all measuring one variable, namely, 
either amount of participation § or 
amount of leadership. Additional light 
may be thrown on this question by the 
Air Force study where other measures 
have been obtained. 


to be 


III. Discussion OF RESULTS 


Our data support the hypothesis that 
a person's self-concepts of his communi- 
cative skills are related to his perform- 
ance in discussion groups. We are not 
certain that our failure to unearth signif- 


5 Contribution to group decisions, contribu- 
tion to the understanding of the _ theorist, 
leadership, keeping the group on the ball, 
dominant, desire to do a good job, and interest. 


icant relationships with the communi- 
cant questionnaire indicates the absence 
of any relationships, since this instru- 
ment appears to be less discriminating 
and sensitive than the communicator 
scale. In view of our previous remarks 
concerning the lack of habituation to 
judgments of communicant skill, we 
think it would be interesting to deter- 
mine whether sensitivity to communicant 
behavior can be increased. Our current 
study of students in speech classes is di- 
rected toward this question. It may well 
be that sensitivity to communicant be- 
havior can only be increased as people 
in general are trained to value listening 
skills more highly. 


The tentativeness of our results should 
again be emphasized in view of the limi- 
tations of this pilot study. Because of 
the small sample, separate analyses of 
sex differences were not attempted, al- 
though these may well exist. In addition, 
neither the number of interactions nor 
the population characteristics were con- 
stant for the three groups. In addition 
it will be noted that each person was 
judged in the context of his group only, 
so that variations in group composition 
may have seriously affected the ratings. 
Attempts were made, however, to com- 
pensate for this effect by employing a 
relative index of rank within the group. 
An interaction process analysis was not 
employed in this pilot study but such a 
procedure is planned for later investi- 
gations. 


IV. SUMMARY 


A pilot study was undertaken of 37 
students enrolled in an advanced under- 
graduate course in the psychology of 
personality to explore the relationship 
between self concepts of communicative 
skill and performance in group discus- 
sions. Two questionnaires, dealing with 
a person’s skill as communicator and 
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communicant, together with some bio- 
graphical questions were distributed to 
all subjects. Ratings of performance in 
three discussion groups were obtained, 
in which each member of a group was 
rated by himself and his entire group. 
Correla’’ »nal and categorical analyses 
revealed significant 


positive relation- 


ship between self-descriptions of com- 
municator skills and performance in 
group discussions, particularly in leader- 
ship, contribution to group decision, 
desire to do a good job, and keeping 
the group on the ball. The communicant 
questionnaire did not show any signif- 
icant relationships to the same variables. 








A STATISTICAL FORMULA TO QUANTIFY THE 
“SPREAD-OF-PARTICIPATION” 
IN GROUP DISCUSSION 
MILTON DICKENS 
University of Southern California 


—aemenggelien research in group 
discussion during recent years has 
indicated among other things that there 
is a need for complete and accurate re- 
cordings of what is said during discus- 
sions, and a need for techniques in 
gathering and manipulating such data. 
In connection with the latter, there is a 
special need for techniques by which to 
evaluate group discussions. In compar- 
ing two or more discussions, we want to 
know which was more effective. By 
what standards should such comparative 
effectiveness be judged? Must such 
judgments be subjective? 

A partial answer to these needs may 
be found through a consideration of 
what might be termed “the spread of 
participation.” Most authorities in the 
field state or imply that all the members 
of a group should be brought into a 
discussion; that it is bad practice to al- 
low an over-talkative member to monop- 
olize the time. Thus, the comparative 
extents to which members of a 
group share the talking, would seem 


to be a reasonably valid measure of one 


the 


aspect of the over-all effectiveness of a 
discussion. Such a measurement has the 
additional advantage that, where tran- 
scripts of discussions are available, it is 
completely quantitative and objective. 

With the foregoing in mind, several 
years ago the writer devised a statistical 
formula which has proved useful and 
may be of interest to other experiment- 
ers. The logic underlying the formula 
may be described as follows. Theoreti- 


cally, the “worst” possible spread-of- 


participation would be a discussion in 
which one member did all the talking. 
For example, in a discussion group of 
five persons in which a total of 5,000 
words were spoken, suppose that one 
member spoke all 5,000 words while 
each of the other four members spoke 
zero words. To balance this, a theoreti- 
cal “best” was assumed to be a discus- 
sion in which the total number of words 
was exactly evenly divided. In the prev- 
ious example, this would amount to 
1,000 words for each of the five partici- 
With the _ theoretical 
thus established, it was thought that for 
convenience a 
that 
duce a score of zero and the other ex- 


pants. extremes 


formula should be so 


devised one extreme would pro- 


treme a score of 1.00. There were, of 
course, several ways by which this could 
be achieved. The simplest seemed to 
be the following: 

b (c—d) + Sx — Sy 

SP = 


2 (a—b) 
In this formula 
SP = spread-of-partic ipation score 


a — total number of words in the 


discussion. 
a 
b= —— 
number of discussants 
c = number of discussants with 


words < b 


d = number of discussants with 
words > b 


=x = sum of words of discussants > b 


Sy = sum of words of discussants < b. 
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Applying this formula to our hy- 
pothetical “worst” case described above, 
we get the following: 


1000 (4—1) + 5000 —o 





SP = ——_—-—_____—— 
2 (5000 — 1000) 
sp 8000 
~~ 8000 
SP = 1.00 


Similarly, if we apply the formula to 
our “‘best’’ case, the score becomes zero. 

To secure a more typical picture 
of how the formula works, it may next 
be applied to an actual discussion tran- 
script. In this discussion there were 
four participants who spoke the follow- 
ing numbers of words respectively: 3,320, 
2,962, 2,380 and 1,137, making a total of 
9-799 the 


\pplving the formula: 


words for whole discussion. 


The distribution of scores was 
fairly normal as can be seen from the 


frequency table shown as Table I. 


cases. 


‘The approximate mean for scores in 
Table I was .331; the standard devia- 
tion was .113; the standard error of the 
mean was The fact that these 
scores tended to group toward the lower, 
i.e., “better” end of the scale, was prob- 
ably explained by the fact that 43 of 


-O17. 


the 45 groups were college students from 
speech classes, and that many of the 
chairmen had had discussion training. 
Each of the groups had a chairman 
whose words were counted in the scor- 
ing. Size of the groups varied from 4 to 
22 members. Type of discussion was 


informal, problem-solving. Discussion 


topics were realistic for the participants: 
football, marriage, 


war preparation. 


_ 2450 (2—2) + (3320 + 2962) — (2380 + 1137) 


SP = — 


2 (9799 — 2450) 


SP — .188 


The mathematical proofs for the for- 
mula can readily be established by any- 
one who wishes to follow through the 
described logic. 

‘To give some notion of what may be 
expected by way of norms when this 
formula is applied, results for 45 experi- 
mental discussions will be briefly pre- 
sented. This series was carried out and 
transcribed under the writer's super- 
vision’ and the formula applied to all 


TABLE I 
FREQUENCY DISTRIBUTION OF 
“SPREAD-OF-PARTICIPATION” SCORES FOR 
45 EXPERIMENTAL GROUP DISCUSSIONS. 








Scores f 
-5O-.59 4 
.40-.49 10 
-30-.39 14 
.20-.29 11 
-10-.19 5 
.00-.09 1 








1The series was made possible through a 
Grant-in-Aid from the Social Science Research 
Council. 


One group was of business people; one 
of faculty members. Fortunately, the 
faculty group (which in theory should 
have been the most efficient) had the 
best score, .0g8. The worst score was 
.596. 

The reliability of the foregoing tech- 
nique is obviously high. Thus, if several 
independent experimenters were to ap- 
ply the formula to the same transcript, 
variations the experimenters 
would be only those due to errors in 


among 


counting or in doing the mathematics. 


The validity of the technique must be 
evaluated with caution. Does the form- 
ula measure what it purports to meas- 
ure? Is the spread-of-participation one 
index of the efficiency or goodness of a 
group discussion? The ultimate answer 
must await further research. To guide 
such research some a priori evaluation 
of the formula seems justified. The logic 


establishing the worst theoretical score 
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would probably be rather unanimously 
agreed upon. The logic establishing the 
best score, however, should probably be 
qualified. Most persons who have had 
practical experience with group discus- 
sions would probably agree that, ex- 
cept for purposes of classroom practice, 
a precisely equal amount of participa- 
tion by members of a group is not neces- 
sarily an ideal to be sought. Rather, the 
discussants should contribute in propor- 
tion to their having something worth- 
On the other hand, all 
participants should probably contribute 
at least a there 
would be no basis for deciding who had 
something worthwhile to say. Also, if 
some members of a group do not under- 


while to say. 


minimum—otherwise 


stand the discussion topic well enough 
to justify using their share of the group’s 
time, that may often be a legitimate 
criticism of the whole meeting since 
it reflects faulty planning or faulty ad- 
vance preparation. Another objection 
to the validity of the formula is that 
individuals vary in the number of words 
used to express any given idea, e.g., a 
discussion may be good despite the fact 
that some members are wordy and others 
are succinct. On the other hand, it 
might be urged that individual differ- 


ences usually follow a normal curve and 
that extremes of either wordiness or 
brevity reduce the effectiveness of a 
discussion group. Other objections to 
the validity of the formula might be 
raised and debated. A resolution of the 
problem will require experimental test- 
ing of at least two hypotheses: (1) That 
the spread-of-participation is a_ valid 
measure of one aspect of the effective- 
ness of group discussion; (2) That the 
spread-of-participation correlates signif- 
icantly with other measures, and is a 
valid measure of the over-all effectiveness 
of group discussion. 

The feasibility of the formula may be 
questioned on the grounds that record- 
ing and transcribing discussions require 
considerable time and money. For pur- 
poses of certain needed research proj- 
ects the foregoing difficulties must be 
accepted. For other purposes, however, 
at least two shortcuts deserve to be ex- 
plored: (1) counting the number of 
speeches rather than the number of 
words, and (2) 
with a stop watch, thus substituting 
number of minutes for number of words. 
The possible usefulness of such short- 
cuts poses another problem for future 
research. 
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AN EXPERIMENTAL STUDY OF THE EFFECTS OF 
TRAINING IN THE RECOGNITION AND 
FORMULATION OF GOALS UPON 
INTRA-GROUP COOPERATION* 


R. VICTOR HARNACK 
University of Colorado 


© pw of the most significant problems 
concerning discussion process is that 
of the inter-personal relationships that 
develop within the group. Discussion 
authorities usually argue that a cooper- 
ative relationship is the most desirable.’ 
Experimental evidence tends to substan- 
tiate this argument.* This is not to argue 
that cooperation is superior to competi- 
tion, per se, but in the discussion group 
it does seem to be the case. 


If, then, cooperation can be assumed 
to be the ideal, it is necessary to develop 
techniques of producing it. The primary 
object of this study was to develop and 
test a method of training individuals to 
create more cooperative relationships. 
First, however, the premises upon which 
the training was based need to be ex- 
plained. 

A. Cooperation and Competition. The 
essence of the theory concerning cooper- 
ation and competition which has been 
developed here is based upon the experi- 


ments conducted by 


Morton Deutsch. 


*Based upon Ph.D. dissertation, Northwestern 
University, 1954, directed by Franklyn S. Hai- 
man. 

icf. James H. McBurney and Kenneth G. 
Hance, Discussion in Human Affairs (New 
York, 1950), p. 7- 

2 Note particularly such studies as: Morton 
Deutsch, “The Effects of Co-operation and 
Competition Upon Group Process,” (Unpub- 
lished Ph.D. dissertation, Massachusetts Insti- 
tute of Technology, 1948); Alexander Minz, 
Jour. of Abnormal and Sociol. Psych., XLVI 
(1951), 150; Peter M. Blau “Co-operation and 
Competition in a Bureaucracy,” American Jour. 
of Sociology, LIX (1954), 530; and Taisuke 
Mizuhara, “Experimental Studies of Coopera- 
tion and Competition,” Japanese Jour. of Psy- 
chology, XXIII (1952), 170. 


He distinguishes between a cooperative 
situation and a competitive situation as 
follows: In a cooperative situation the 
individuals involved can reach to some 
degree their respective goals only if all 
the individuals in the situation can also 
reach their goals. Deutsch 
“promotively 


calls such 
goals interdependent 
goals.” In a competitive situation the 
goals are such that an individual can 
reach his goals to some degree only if 
the others do NOT reach their goals. He 
designates these as “contriently interde- 
pendent goals.” 


‘These descriptions do not define co- 
operation and competition, but they de- 
scribe the social situations that give rise 
to cooperative and competitive relation- 
ships. Athletic contests provide excellent 
examples. In a football game, for in- 
stance, the members of one team are 
promotively interrelated with respect to 
winning the game while the two teams 
are contriently interrelated with respect 
to that same goal. 


For the purposes of this study, a coop- 
erative relationship will be defined as 
one which exists when two or more indi- 
viduals perceive that their respective 
goals are promotively interdependent. 
A competitive relationship will be de- 
fined as one which exists when two or 
more individuals perceive that their re- 
spective goals are contriently interdepen- 
dent. (The nature of cooperative and 
competitive behavior which can be ex- 


pected to ensue will be discussed when 
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the measuring instruments used in the 
experiment are presented.) 

It should be remembered that coop- 
eration and competition are only names 
given to an inferred relationship. One 
cannot observe this relationship but on- 
ly acts that may be indices of the pres- 
ence of the relationship. This relation- 
ship, in turn, is taken to be a function 
of the fashion in which the individuals 
involved perceive their goals. 

B. Goals. 
mean any condition which, once attained 


A goal will be taken to 
could 
that 
individual is 


by an_ individual, conceivably 


reduce tension within individual. 


When an faced with a 


problem situation (a tension unre- 
solved) he selects a goal and directs his 


efforts toward that goal. 
Two things need to be noted immed- 
iately last 


individual 


concerning this 


The first 


statement. 
is that the selects 
certain conditions (goals) from a poten- 
tially large field. The second is that an 
individual is faced (in a complex situ- 
ation such as a discussion group meet- 
ing) with a complex multiplicity of prob- 
lems. In addition to the large prob- 
lem(s) facing the group, the individual 
must solve a host of smaller problems 
arising from his interaction with the 
others, and the itself 


may be broken down into several small- 


main problem 


er sub-problems. 


As has been previously indicated, the 
nature of the goal influences an individ- 
ual’s relationships with others. Hence, 
the question of whether it is possible to 
restructure goal patterns in order to 
create cooperative relationships becomes 
pertinent. 

The training procedure tested by this 
study was based upon the assumption 
that it is possible for an individual to 
do something about his goals in a dis- 
situation. An 


cussion individual may 


usually do one or the other of the follow- 


ing whenever he discovers that he is 


pursuing 


g  contriently 


y interdependent 
goals. (a) He may compromise some 
goals in order to attain other (presum- 
ably more important) goals which are 
not contriently interdependent; (b) he 
may select a different goal which will 
alleviate the problem but which will not 
be contriently interdependent. 


the 
designed to: (a) sensitize the individual 


Thus, training procedure was 
to the problem of goals; (b) give him 
insight into the fashion in which goals 
affect interpersonal relationships, and 
(c) develop skill in restructuring his own 
goals and those of others in the discus- 
sion group. It seemed as if this was the 
most direct means of attacking the prob- 
lem of developing cooperative relation- 
ships. 


I. ‘THE EXPERIMENTAL HYPOTHESES 


Iwo hypotheses were set up for test 
ing. These hypotheses represent the a 
priori assumptions which the experi- 
ment was designed to test. 

1. Groups of individuals who have 
received training designed to develop 
insight into the functioning of goals, 
and skill in restructuring goals, will co- 
operate more effectively than groups of 
individuals who have not received such 
training. 

This hypothesis is designed simply to 
Will the type of 
training employed in this study produce 


answer the question: 
a difference in behavior? If the answer 
to this question is affirmative, it is possi- 
ble to assert that cooperation can be 
taught. The hypothesis does not pro- 
vide for comparisons with other methods 
of training. 

2. The amount of goal-setting be- 
havior in discussion groups will be posi- 
tively correlated with the degree of co- 
operation in those groups. 


This hypothesis predicates a relation- 
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ship between the amount of attention 
given to the problem of goal-setting and 
the degree of cooperation. The testing 
of this hypothesis is dependent upon 
means for measuring the amount of 
goal-setting behavior as well as means 
for measuring the degree of cooperation. 
In essence, this hypothesis suggests a 
means for indirectly testing the theory 
of goals presented in the earlier portion 
of this paper. 

I]. ‘He EXPERIMENTAL SITUATION 
AND PROCEDURE® 


A. The Experimental Design.t Twen- 
ty groups of approximately five subjects 
each were used for the experiment. Ten 
of the groups were observed in a dis- 
cussion situation prior to the training 
period, and the remaining ten groups 
were observed after training. All of the 
subjects were given the training proced- 
ure during the five day interval. The 
from 


subjects were drawn beginning 


classes in group discussion taught dur- 
ing the Spring Quarter of 1954 at North- 
western University. (The experiments 
took place approximately at mid-term.) 
Each of the 20 test groups was composed 
of subjects drawn from the same class. 
This design is based upon the assump- 
tion that the subjects who did not re- 
ceive the training before observation 
(UT subjects) were equivalent in rele- 


3A pilot study was performed, prior to the 
study reported here, which used eight groups. 
Four groups were given a pre-test and a post- 
test with a three-day training period interven- 
ing. The other four groups were used as 
controls, They were given a pre-test and a post- 
test with no intervening training. 

*This experimental design is called by 
Campbell the “offset before and after design.” 
It was used in preference to the traditional 
pretest-posttest design for several reasons which 
included: the ease of training all the subjects 
since they were all enrolled in discussion classes 
at the same time, the opportunity of using a 
discussion problem which involved playing a 
hoax upon the subjects, and certain scientific 
reasons which are discussed by Donald T. 
Campbell, “Designs for Social Science Experi- 
ments,” unpublished MS., U. of Chicago, 1953. 


vant factors to those who did receive the 
training prior to observation (TR sub- 
jects). ‘This assumption was tested as 
follows: The final class grade in discus- 
sion for each of the subjects was ob- 
tained. (Since the UT subjects actually 
received the same training as the TR 
subjects, their final grades were compara- 
ble.) The grades were translated into 
numerical units on the basis that A = 5, 
B= The mean difference of .17 
was treated statistically by means of the 
“te” test.5 The “t”’ 


4, ete. 


value of .773 could be 
expected to occur 50 percent of the time 
by chance alone. Thus, it is possible to 
assume that the two populations were 
sufficiently similar for comparative pur- 
poses. 

B. The Testing Procedure. The test- 
ing situation consisted of a 40 minute 
period during which the group discussed 
a problem dealing with racial discrimi- 
sororities at 


nation in fraternities and 


Northwestern. ‘The subjects were told 
that their group recommendations would 
be printed, together with their signa- 
tures, in the Northwestern Daily. This 
was done in order to produce involve- 
ment and to make the experimental 
situation as “lifelike” as possible. ‘This 
hoax was explained to the subjects im- 
mediately after the completion of the 
observation session and they all agreed 
to keep the matter a secret. 

Three instruments were used to gather 
data from the discussions. The first was 
an eleven (OB) 
which was filled out jointly by two ob- 
servers at 


item questionnaire 


the close of the discussion.® 


All of the items represented behaviors 


which Deutsch found distinguished 
5See any standard statistics text for a 
theory of the “t” test of significance and the 


formulas for calculating it. 
6 Four observers, operating in teams of two, 


were used. All were undergraduates with at 
least three courses in discussion methods and 
were chosen upon the basis of outstanding 
ability. 
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between cooperative and competitive 
groups at a significant level. Each item 
was rated on a scale of values ranging 
from 1 to g with the higher scores indi- 


cating a greater degree of cooperation. 

The second instrument was a ten item 
questionnaire (Q) which was filled out 
independently by each of the subjects 
at the close of the testing session. A 
similar scale of values was provided, and 
again all items represented behaviors 
which Deutsch found distinguished at 
a significant level. 

The third instrument was a five cate- 
gory interaction analysis (IA) which was 
developed by the experimenter and the 
observers especially for this study. The 
first three categories were designed to 
classify types of goal behavior; the 
fourth, residual positive behavior; and 
the fifth, residual negative behavior. ‘The 
two observers were instructed to classifs 
all verbal contributions. They scored 
independently, and the mean in each 
category was used as the unit for com- 
parison. 

Reliability for this instrument was cal- 
culated by use of a modification of the 
chi-square “goodness of fit’ technique 
developed by Bales to test reliability for 
his interaction analysis.? “The object was 
to determine whether the totals scored 
by each observer in each of the five cate- 
gories differed significantly from each 
other. Only one of the go sets of obser- 
vations revealed a statistically signifi- 
cant difference. Eleven of the probabili- 
ty levels equalled or exceeded .50 which 
is the level that Bales defines as accepta- 
ble. Thus, it seems reasonable to con- 
clude that the reliability level for the 
entire set of groups was sufficiently high 
to warrant reasonable confidence in the 
data. 


In addition to these three instruments 


7 Robert F. Bales, Interaction Process Analy 
sis (Cambridge, Mass., 1951), pp. 100 ff 


a questionnaire was administered to the 
trainers (exclusive of the experimenters) 
which was designed to obtain their re- 
actions to the theory and the method of 
training. 

C. The Training Procedure. Six in- 
dividuals, including the experimenter, 
acted as trainers in the experiment. 
Three were the teachers of the courses 
from which the subjects were drawn, 
and two were teaching assistants in one 
of the courses. In order to maximize the 
opportunities for practice, the training 
groups were held to approximately ten 
subjects in each. 

Five training sessions of 50 minutes 
each were used. (One class met for three 
50 minute sessions and one go minute 
session.) The essential theory was de- 
veloped by the experimenter into a 12- 
page manual which was distributed for 
student use. With the exception of the 
second day, the training was conducted 
through some modification of the dis- 
cussion procedure. 
the 


(On the second day 
delivered a lecture based 
upon the theory of goals.) Role-playing 
sessions used the fourth and 
fifth days in order to develop skill in 
resolving conflict situations. 


trainer 


were on 


Training of this nature could not be 
rigorously controlled, but the objectives 
for each session were made explicit for 
the trainer who provided guidance for 
the discussions. 


Ill. QUANTITATIVE RESULTS 


A. Hypothesis 1: This hypothesis 
stated that groups of trained individuals 
would cooperate more effectively than 
untrained groups. The familiar “t” test 
of significance was applied to determine 
whether the observed 
greater than chance.® 


differences were 


8 For the particular formula used see Allen 
L. Edwards, Statistical Analysis (New York, 
1946), p. 177. 
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the First comparisons were made be-_ the results obtained from OB, the UT 
ers) | tween the trained (TR) groups and the groups tended (with two exceptions) to 
re- untrained (UT) groups using data from rate themselves as somewhat more co- 
lof | the observers’ questionnaire (OB). The operative than the TR groups. The 
first comparison employed the total differences were not, with the exception 
in- scores on OB for each group. The first of the difference relative to “quality of 
iter, item in Table I shows that the mean product,” large enough to be significant. 
ent. difference was 16.7 in favor of the TR Table II contains the relevant data ex- 
rses groups. This difference is significant at cluding questions 2 and 3 which needed 
wn, the .o5 level of confidence. a different treatment. 
one Each of the eleven questions on OB Questions 2 and 3 attempted to obtain 
the was then tested separately. All of the data pertinent to the degree of coopera- 
1ing differences favored the TR groups, but tion by a somewhat different means. 
ten not all were significant. The items Question 2 asked, “How many sugges- 
revealing significant differences were tions did you have which (for any rea- 
utes “group productivity,” “group-centered- son) you did not make to the group?” 
hree ness,” and “ease of communication.” Question 3 asked, “If answer to number 
nute A different picture appears when the 2 is anything but ‘O’, why didn’t you 
de- results from the subjects’ questionnaire make these suggestions?” Some of the 





| 12- (Q) are examined. In direct contrast to reasons indicated competitive relations, 
| for 


























\ rABLE I 
the COMPARISONS BETWEEN TR AND UT Groups Usinc OB Data. 
icted Mean of TR Mean of UT 
dis- Question Groups Groups M Diff = 
day Total Score (all questions) 60.3 43.6 16.7 2.23° 
vased | 1. Productivity 5.0 2.8 2.2 2.62** 
. | 2. Systematic Approach 4:7 3.1 1.6 1.91 
ying g. Attention to Problem 5.5 4.2 1.3 1.39 
and | 4a. Friendliness 5.3 4:7 6 66 
n in | 4b. Group-Centeredness 5-7 3.5 2.2 sa. 
4c. Ease of Communication 6.6 4.9 1.7 2.21* 
4d. Sense of direction 5.2 3.7 1.5 1.63 
4e. Coordination of Effort 5.2 3.7 1.5 1.58 
»t be 4f. Attention to Each Other 6.3 B.A 9 .98 
tives 4g- Achievement Pressure 5-2 3.3 1.9 1.61 
4h. Acceptance of Others’ Ideas 5.6 4-9 D 1.27 
t for 
e for *Significant at the .os level. 
**Significant at the .o2 level. 
***Significant at the .o1 level. 
rABLE Il 
COMPARISONS BETWEEN TR AND UT Groups Ustnc Q DATA, QUESTIONS 1, 4, 5, 6, 7, 8, 9, AND 10. 
thesis Mean Mean 
i of TR of UT 
duals Question Groups Groups M diff 
than - ; — - 
a Total Score (all 8 questions) 51.514 55-149 — 3.635 1.41 
test 1. Group Helped Your Thinking 4.751 5.612 — 861 77 
‘mine 4. Ease of Communication 7-159 7.470 —.411 1.11 
5. Degree of Attentiveness 6.851 6.745 106 17 
were 6. Interest in the Problem 7.480 7.835, — .355 1.04 
7. Amount of Group Feeling 6.292 6.go02 —.610 1.49 
Degree of Cooperation 6.860 7.342 —.482 1.02 
9g. Quality of Product 5.876 7-152 — 1.276 2.19° 
» Allen 10. How Little at Odds with Others 6.245 6.091 15 35 
York, 
*Significant at the .o5 level. 
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others cooperative relations, and others 


were neutral. (None were so identi- 
fied on the forms given to the subjects.) 
The TR groups noted omission of a 
significantly larger number of total sug- 
gestions than the UT groups; they gave 
a larger (but not significant) number 
of reasons for omissions; they offered 
more cooperative and neutral reasons 
(not significant); and they offered signi- 
ficantly more competitive reasons. 

The next set of comparisons were those 


(1A) 
data. Apparently because of a livelier 


made using interaction analysis 


exchange of comments within the 


TR 
with a significantly 


groups, the groups were credited 
larger number ol 
total interactions. The IR groups were 
scored as having made a larger total of 
goal-oriented contributions, but a small- 
er proportion of their contributions fell 
in the three goal-behavior categories.® 
Neither difference was significant. 

B. Hypothesis 2: This hypothesis 
stated that there would be a positive cor- 
relation between the amount of goal- 
behavior and the degree of cooperation 
present in the discussion groups. This 
was tested by correlating the amount of 
goal-behavior recorded by IA (categories 
1, 2, and 3) with the “total cooperation” 
scores from OB and then Q. The corre- 
lations first using IA proportion scores 
and then IA the OB 
scores showed positive correlations as 
predicted by the hypothesis. However, 
the correlations were only .10 and .42 re- 


raw scores with 


spectively. Negative correlations of —.11 
and —.34 were found when the same 
two correlations were calculated using 


Q data ‘otals. 


IV. AND 


A. Sources of Error. 


CONCLUSIONS INTERPRETATIONS 


Betore examin- 


9 Proportion scores were calculated by di- 
viding the total number of verbal contributions 
scored for the group into the number of con- 
tributions which were classified in the three 
goal categories. 


ing the data, some of the more impor- 


tant sources of error and bias should 
be noted. One of these may have been 
the possibility that the training proce- 
dure operated to “cue” the subjects as 
to what was expected of them in the 
testing situations. There is no outward 
evidence that this did happen, and even 
had it been the case, the subjects would 
still have had to acquire some skills to 
perform in such a fashion. Moreover, 
the subjects were assured that their test- 
ing performances would have no _ in- 
fluence upon their course grades. ‘Thus, 
they had no apparent reason for respond- 
ing in anything but a “natural” fashion 
to the testing situation. 

The time lapse between the observa- 
the UT TR 
groups may have affected the observers’ 


tions of groups and the 


reactions. Also it should be noted that 
the observers knew, of necessity, which 
groups had been trained. There is no 
evidence to indicate that either operated 
to the disadvantage of the UT groups. 
In fact, it may be as reasonable to sup- 
that 
backward” to be fair. 


pose the observers “leaned over 

B. Conclusions Regarding Hypothe- 
sis 2: The first problem is that of ac- 
counting for the contradictory results 
obtained by OB and Q. The explana- 
tion which seemed most plausible’ was 
that the UT groups were neither as dis- 


criminating nor as rigorous as the TR 


groups in their evaluations of them- 
selves. This might well have been a 
function of the greater insight into 


group processes which it was hypothe- 
sized would result from training. 

In order to test this interpretation, a 
run for all 20 groups 
(both TR and UT) between the sums 
of the ratings for the eight questions 
(excluding questions 2 and 3) on Q and 


correlation was 


the sums of the ratings for the eleven 
questions on OB. A second correlation 
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was performed, using the rating totals 
for the UT groups only, on Q and OB; 
and a third correlation was computed 
between the rating totals for the TR 
only Q OB. The 
of correlations were 


groups on and 


same three kinds 
run using those individual questions 
from Q and OB that were comparable. 


(There were six of these.) The results of 


More evidence in support of the argu- 
ment that the TR self-ratings were more 
rigorous is found in the fact that the 
TR subjects recorded a_ significantly 
larger number of suggestions not made 
and offered a substantially larger num- 
ber of reasons. Perhaps more noteworthy 
is the fact that none of the UT subjects 


added any reasons of their own to the 


these computations may be found in ones provided on the questionnaire to 
Table III. explain any failures to make suggestions 
TABLE III 


CORRELATIONS BETWEEN Q DATA AND OB DATA. 














r ALL cur r TR 

Items of Comparison Groups Groups Groups 
© (totals) and OB (totals) 076 —.230 465 
Q- 4 and OB-4c (ease of communication) O81 472 063 
()- 5 and OB-4f (attentiveness to others) —.139 —.00% —.335 
Q- 7 and OB-4b (degree of group feeling) —.102 —.414 404 
Q- 7 and OB-4e (degree of group feeling) 

from O and working together on OB) 165 —.102 433 
QO- g and OB-1 (productivity) O34 —.236 734* 
Q-10 and OB-4a (lack of competitiveness 

on Q and friendliness on OB) .229 —.050 464 
*This r differs significantly from the r of —.263 at the .o5 level. 


The results of these operations showed 
that the TR self-ratings tended to corre- 
late more highly the observers’ 
ratings than the UT self-ratings. It is 
particularly 


with 


interesting to note, when 
the question of productivity was com- 
pared (OB 1 Q 9g), that the correlation 
coefficient obtained from TR self-ratings 
compared with OB ratings was .734, and 
the correlation coefficient obtained by 
comparing UT self-ratings with OB rat- 
ings was —.263. The difference between 
the two correlation coefficients was sta- 
tistically significant at the .o5 level.'° 
(Question g on Q had produced the 
only statistically significant difference in 
favor of the UT groups. Similarly ques- 
tion 1 on OB had produced a statisti- 
cally significant difference in favor of 
the TR groups.) 

10 For the procedure of testing to determine 
whether two correlation coefficients differ sig- 
nificantly from each other, see George W. 


Snedecor, Statistical Methods (Ames, Iowa, 
1946), Pp. 151. 


to the group. On the other hand, the 
TR subjects offered a total of 
additional reasons. 


nine 


The first phenomenon noted with re- 
spect to the IA data is that the TR 
groups were credited with a significantly 
larger number of total interactions. This 
seems strange since all groups discussed 
for approximately the same length of 
time. The discrepancy in totals may 
have been a function of several varia- 
bles. One may have been the tendency 
of the TR groups to prolong the discus- 
sion until stopped by the experimenter. 
Probably more important, however, is 
the likelihood that the interchange was 
livelier in the TR groups. This may be 
accounted for by the possibility that the 
TR subjects may have felt more secure 
in the testing situation since they had 
passed through a period of training prior 
to the testing. 

Considering the qualifications and in- 
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terpretations discussed above, the fol- 
lowing conclusions regarding Hypothe- 
sis 1 seem warranted: 

1. The subjects’ questionnaire (Q) 
does not seem to be particularly useful 
for comparative purposes. The data ob- 
tained from the interaction analysis (IA) 
seem somewhat ambiguous, but they 
appear to have a restricted amount of 
use. The observers’ questionnaire (OB) 
seems to be but caution 
should be exercised in interpreting the 
results. 


most valid, 


2. Because of the correlation between 
their self-ratings and the observers’ rat- 
ings, and because of their apparently 
greater awareness of suggestions omitted 
and the reasons for the omissions, the 
trained (TR) subjects appear to have 
possessed a greater degree of insight 
into process factors than the untrained 
(UT) subjects. 

3. Relying primarily upon the results 
of the IA data, it cannot be said that the 
TR groups cooperated more effectively 
than the UT groups, although the gen- 
eral tendency is in this direction. 

4- Relying primarily upon the results 
of the OB data, it appears as if the TR 
groups cooperated more effectively than 
the UT groups. 

5. Thus, it seems safe to assert that, 
although hypothesis 1 has not been un- 
equivocally confirmed, it has received 
support from the data presented. 


B. Conclusions Regarding Hypothe- 
sis 2. This hypothesis was phrased: “The 
amount of goal-setting behavior in dis- 
cussion groups will be positively corre- 
lated with the degree of cooperation in 
those groups.” Little support for this 
hypothesis can be found in the data pre- 
sented. An argument may be advanced 
that the interaction an analysis was too 
insensitive to detect all types of goal- 
behavior, but there is no evidence for 
this position. 


C. Implications of Findings. In this 
regard it should be noted that all the 
trainers regarded the goal-oriented ap- 
proach as a feasible method of working 
toward the production of intra-group 
cooperation. Several intended making 
modifications of varying degrees in their 
method of process training as a result 
of their experiences with this study. 

Upon the basis of the qualitative evi- 
dence offered by the trainers and the 
quantitative data produced by the ex- 
periment, it appears that the goal-ori- 
ented approach to process training out- 
lined in this paper deserves the attention 
of discussion trainers and merits further 
investigation as a_ teaching method. 
While this statement must be qualified, 
it appears that the trained groups did 
cooperate more than the untrained 
groups and this appears to have been a 


function of the training procedure. 
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THE INDEX OF AGREEMENT: 


A POSSIBLE CRITERION 


FOR MEASURING THE OUTCOME OF GROUP DISCUSSION 


JACK MATTHEWS and A. W. BENDIG* 
University of Pittsburgh 


— paper is concerned with devel- 
oping a method of quantifying one 
of the many possible outcomes of the 
discussion process. The outcome we are 
interested in measuring is agreement 
among discussants. Although most stu- 
dents of discussion probably have simi- 
lar concepts of “agreement” we feel it is 
desirable to define the term operation- 
ally before proceeding with a descrip- 
tion of its measurement. 

Let us assume a group made up of 
only two individuals, A and B. Let us 
further assume they discuss a problem 
which has only two alternative solutions, 
I and II. If individuals A and B both 
select the same solution (both select so- 
lution I or both select solution II) we 
can say that A and B “agree” on the 
solution to the problem. If A and B 
select different solutions (A_ selects I 
while B selects II) we can say there is no 
“agreement” between A and B. A two- 
person group discussion of a problem 
with only two solutions is certainly not 
a typical discussion situation. The illus- 
tration is given only for simplicity’s sake. 
From this oversimplified illustration we 
can draw an important generalization 
about “agreement.” When each of the 
separate solutions to a problem is select- 
ed by an equal number of participants 
there is minimum “agreement” among 
the participants. When all the partici- 
pants select the same solution “agree- 
ment” is at a maximum. 


*George Shames_ contributed suggestions 
which, while not specifically acknowledged, have 
been most useful. 


Let us now apply the generalization 
given above to a slightly more complex 
situation in which four participants, 
A, B, C, and D have discussed a problem 
with alternate solutions I and II. If two 
of the participants (A and B) select one 
solution (solution I) while the other two 
participants (C and D) select the other 
solution (solution II) we have minimum 
“agreement.” If all four participants 
(A, B, C, and D) select the same solution 
(all select either solution I or solution 
Il) “agreement” is maximal. An inter- 
mediate level of “agreement” would be 
the instance of three participants select- 
ing one solution and one participant se- 
lecting the other solution (A, B, and C 
select solution I, and individual D selects 
solution IT). 

Our 
plex if we consider a typical group 
discussion situation in which at the end 


illustration~ becomes more com- 


of the discussion the participants are 
asked to select a solution or a point of 
view from rather than 
from two as cited in our illustrations. It 
would seem easier for A and B to agree 
on a single choice when the only choices 
are I and II than to get A and B to agree 
on the same choice when three choices 
are available: 


among many 


choice I, choice II, and 
choice III. This suggests our second gen- 
eralization. Jt is more difficult to get 
participants to agree on a solution when 
there are many possible solutions than 
when only a few alternative solutions 
are possible. 

The authors recognize that in many 
instances discussion participants are not 
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but instead 


express an 


asked to select a solution 
asked to 


point of view, etc. 


may be attitude, 
We use “solution” as 
a general term to cover the various kinds 
of choices discussants may be called upon 
to make at the conclusion of a discus- 


sion. These 


‘ 


‘solutions” or choices may 
represent points on a scale (“strongly 
agree,’ “mildly agree,” “undecided,” 
etc.) or simply different categories of re- 
sponses which may not all be part of a 
single continuum (‘I don’t have enough 
information to decide,” “I favor the goal 
but object to the method,” “I give the 
proposal my strong endorsement,” etc.). 
The measure we have developed is appli- 
cable in both instances, while the cur- 
rently Statistical 


available procedures 


are useful only for the first type of 
choices. 

The two generalizations stated earlier 
may be summarized in a formula for a 
Statistic we the 


ment (IA): 


l i as 
IA — _ (s: Sn* ... | ) (1) 


where N is the total number of partici- 
pants, k is the 
solutions to the problem discussed, and 


call Index of Agree- 


number of alternative 
n represents the number of participants 
selecting each one of the solutions. The 
Index of Agreement has the mathemati- 
cal advantage of being related to the 
chi-square distribution. 

The derivation of the Index of Agree- 
ment is based upon the definition of the 
Statistic chi-square. In a situation where 
a total of N 
over k different categories 


measures are distributed 
(solutions) 
with n measures appearing in each of the 


categories, we would expect that N/k 


measures would fall into each of the 
categories by chance. That is: 
E (n) = N/k (2) 


The statistic chi-square is traditionally 
used to square the extent and signifi- 


the differences between the 


number of 


cance of 
obtained measures in each 
category (m) and the expected numbet 
in each category under an equal prob- 
ability hypothesis, or, in the notation 
used here: 

Obtained chi-square = a x) (2) 
, 3) 
In a given experimental situation the 
N and k are usually fixed so that equa- 


tion (3) can be simplified to: 


; , k3Sn?* ; 
Obtained chi-square =X — N 
(4) 
Also in such a situation the obtained 


chi-square will have a maximum value 
when all of the N measures are concen- 
trated in one of the k categories with 
zero frequencies in the remaining k—l 
categories. This maximum chi-square 
both the N 
the k and can be shown to be: 


value is a function of and 





Maximum chi-square = N(k—l) (5) 


The Index of Agreement (IA) is defined 
as the ratio of the obtained chi-square 
(with a given distribution of m) to the 
maximum chi-square possible. The equa- 
tion for IA can be obtained by dividing 
equation (4) by equation (5) which 
gives as a working formula for IA: 


Index of Agreement — 


ewe 
k—I (s — (1) 


‘The Index of Agreement has two char- 


acteristics that reflect the earlier stated 
generalizations. First, IA has a maxi- 


mum value of 1.00 when all of the par- 
ticipants select the same solution and a 
minimum value of zero when each of 
the solutions are selected by an equal 
number of participants. Second, it is a 
function of the number of alternative 
solutions available to the participants. 

The statistical significance of a single 
Index of Agreement can be evaluated 
by testing the hypothesis that the ob- 
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MEASURING 


tained measures in the categories (7) 
depart significantly from the expected 
numbers of measures in the categories 
(N/k). This hypothesis can be tested by: 


Obtained « hi-square — 
(TA) (N) (k—]) (6) 


l 


degrees of freedom and its significance 


This obtained chi-square will have k 





evaluated by the traditional table of the 
theoretical chi-square distribution. 


The significance of the difference be- 
tween two Indices of Agreement, when 
each Index is computed from a differ- 
ent sample of subjects, can be deter- 
mined from the F distribution. The sta- 
tistic*/ is defined as the ratio of two 
independent varieties both of which are 
distributed as chi-square over its degrees 
of freedom. In the usual experimental 
situation where an Index of Agreement 
in one sample (IA,) is to be compared 
to an Index of Agreement from a second 
sample (IA,) the k’s will be the same 


and only the N’s will vary. In this case: 


— (1A,) (N,) 

(IA,) (N,) (7) 
with this F ratio having k—/ and k—l 
degrees of freedom. Using equation (7) 
the numerator (IA,) (N,) should always 
(1A,) 
(N,). This ratio can be referred to the 
standard F tables* to test its significance 
with the caution that the probability 
values given in the table (5 and 1 per 
cent) must be doubled 


be larger that the denominator 


(10 and 2 per 
cent) since equation (7) is a two-tailed 
test. When the N’s of the two samples 
are equal, equation (7) reduces to: 


F = (1A,) /(IA,) (8) 
again with k—/ and k—/ degrees of free- 
dom. 


1E. F. Lindquist, Design and Analysis of 
Experiments in Psychology and Education (New 
York, 1953), Pp. 29. 

2 Lindquist, pp. 4°-44. 


THE OUTCOME OF DISCUSSION 41 


The problem of the significance of the 
difference between two Indices of Agree- 
ment from the same sample of subjects 
(for example pre and post discussion 
measures from the same group) is as vet 


unsolved because of the lack of inde- 
pendence of the two sets of category 
frequencies. 

We will now apply the Index of 


Agreement to several sets of hypotheti- 
cal data. Table I summarizes the “‘solu- 
tions” selected by members of Group A 
and Group B at the conclusion of a dis- 
cussion session. 

The Index of Agreement for Group 
A would have a value of .31 and for 
Group B the IA has a value of .og. Does 
the IA of .g1 suggest that the members 
of Group A are agreed among them- 
selves or could an IA of .31 simply have 
arisen by chance? ‘The statistical signif- 
icance of the IA of .31 can be evaluated 
by formula (6). By applying formula 
(6) we discover that the chi-square value 
is 14.08 and could be expected to arise 
by chance less than one time in a 100. 
We therefore conclude that the IA of 
.1 for Group A in all likelihood did 
not arise by chance. As is the case with 
other statistics the IA does not tell us 
what caused the agreement. This must 
be determined by factors incorporated 
into the design of an experiment. 

When we determine the significance 
of the IA of .o2 for Group B using for- 
mula (6) we find that an IA of this mag- 
nitude could be expected to arise by 


chance 80 times out of 100. In keeping 


TABLE I 








Number of Participants 
Selecting Each Solution (n) 


Group A Group B 
I 2 t 
II 10 5 
Ill 1 3 
IV - 3 
Total Number — —_ 
Participants (N) in 15 
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with usual statistical interpretation we 
would be inclined to conclude that the 
IA for Group B is not significant. 


Let us now consider the problem of 
determining the significance of the dif- 
ference between two Indices of Agree- 
ment. We may again refer to the data 
of Table I for illustrative purposes. 
Suppose that Group A and Group B 
were each subjected to a different leader- 
ship style and we wished to determine 
the effect of leadership style on agree- 
ment. The IA might well have been 
used as a means of matching groups ini- 
tially (pre discussion) on agreement. By 
determining the signifiance of the differ- 
ence between the IA of .g1 and .o2 we 
can determine whether the amount of 
agreement resulting from leadership style 
A differs from the amount growing out 
of leadership style B. By means of formu- 
la (8), since the number of participants 
in the two groups are equal, we deter- 
that 


the difference between the 


(.31 
.02) could be expected to arise by chance 


mine 


two Indices of Agreement and 


less than 1 time out of 100 (F equals 
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19.56 with 3 and g degrees of freedom). 
We therefore conclude that Group A 
demonstrated a significantly greater 
amount of agreement than did Group B. 
As yet we have not developed a satis- 
factory method of computing the signifi- 
cance of the difference between two In- 
dices of Agreement from the same sam- 
ple of subjects (pre and post discussion 
measures from the same group). A possi- 
ble approach to this problem may be 
the use of a nonparametric technique. 
Bendig has made an application of the 
Index of Agreement to data concerned 
with judgments of adjustment and 
his paper is suggestive of applications of 
the Index of Agreement to data fre- 
quently encountered in speech ratings.* 
At this time the Index of Agreement has 
certainly not been used enough in re- 
search in discussion to establish its ulti- 
mate worth. We have presented it at this 


“é 


time as one possible ‘“‘yardstick’’ for 


measuring one outcome of group dis- 
cussion. 


3A. W. Bendig, “Rater Experience and Case 
History Judgments of Adjustment,” Jour. of 
Clinical Psychology. In press. 
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fects of two leader roles on the discus- 
sion group as shown by group member 
feelings and observer impressions in re- 
gard to: 


(a) Amount of group involvement 
(active interest and absorption of mem- 
bers in the group and its problem). 

(b) The warmth and friendliness of 
the atmosphere. 

(c) Ease in making contributions. 

(d) Cooperativeness of the group. 

(e) The amount of progress made in 
terms of what could be reasonably ex- 
pected. 

(f) The manner in which the group 
got started. 

(g) The manner in which the leader 
handled the group. 


*Based upon M.A. thesis, University of Okla- 
homa, 1952, directed by John W. Keltner. 





GROUP-CENTERED AND LEADER-CENTERED 
LEADERSHIP: AN EXPERIMENTAL STUDY* 
RICHARD R. WISCHMEIER 
Burlington, Iowa 
ECOGNIZING that improved lead- (h) The quality of the leader’s con- 
ership methods may result in more tributions. 
effective group operation, this investiga- (3) To provide a rationale for ex- 
tion was concerned with testing two plaining the behavior of the group mem- 
leadership roles in discussion groups. bers under the two leader roles and 
The major objective was to study the thereby suggest some implications. 
differential effects of the group-centered The two leadership roles, as they 
and leader-centered leader roles upon were used in this study, were defined 
group member feelings and attitudes. .. gojiows: 
The specific objectives of this study were: pegined quite broadly, group-centered leader- 
(1) To determine the differential ef- ship is an approach in which the leader places 
t teal hes om ened nem value on two goals: the ultimate development 
oric 7) > oY ¢ A 4 - . P : 
fects of ee. oe oon ay P of the group’s independence and self-responsi- 
ber attitudes: bility and the release of the group’s potential 
(a) Toward discussion leadership. capacities.1 
(b) Toward the discussion leader as In the group-centered approach, the 
a contributor to the discussion. leader’s primary concern is in facilitat- 
(c) Toward the discussion leader as ing the group development. Promoting 
a person. his own ideas and directing the members 
, toward his own goals are not his con- 
(2) To determine the differential ef- 8 


cern; his attention is centered on bring- 
ing out the maximum resources of every 
group member. 

In contrast to this approach, the lead- 
er-centered leader, as perceived in this 
study, was defined as being the more di- 
rective. He is more obviously active 
than the group-centered leader and does 
not hesitate to inject his own ideas into 
the discussion and to evaluate those of 
others. He is in a dominating role.2 The 
leader-centered leader is the expert or 


1 Thomas Gordon, “Group-centered Leader- 
ship and Administration,” in Carl Rogers, 
Client Centered Therapy (New York, 1951), p. 
377- The approach used in this experiment 
was an adaptation of Gordon's concepts. 

2In effect the role described here is quite 
similar to Lippit’s “autocratic” leader. See 
Ronald Lippit and Ralph K. White, “An 
Experimental Study of Leadership and Group 
Life,” in Theodore L. Newcombe and Eugene 
Hartley, eds., Readings in Social Psychology 
(New York, 1947), pp. 315-330 
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Group-Centered 


1. Leader asks a group member to summarize 
wherever possible. 


no 


reflects member ideas 


without attempting to influence. 


Leader clarifies and 


3. Leader makes few evaluative statements. 


j. Leader is concerned with utilizing all of 


the human resources of the group and thus 
encourages all members to participate. 


5. Leader places responsibility for decisions 


about discussion procedure on the group 


as a whole. 


6. Leader seldom gives his own opinion: he 


introduces new information only when he 


feels it is necessary, i.e., the group is stalled, 
or the members do not possess the necessary 
information. 


Leader is not concerned with maintaining 


~ 


his position as leader, but attempts to dis- 
tribute the leadership functions throughout 
the group. 


8. Leader is more concerned with the group 


process than he is with the final outcome. 

g. Leader method is informal and largely non- 
directive. Group is relatively free to direct 
its own course of action. 


Leader-Centered 


1. Leader assumes full responsibility for sum- 
marizing. 
phasizing 


He summarizes periodically, em- 


those ideas which he _ considers 


most important. 


ho 


Leader interprets, rephrases and modifies a 


member's contributions to conform with 


what he considers most important. 


the 
the members and expresses approval or dis 
approval. 


3. Leader freely evaluates statements of 


{. Leader makes little apparent effort to bring 
all members into the discussion so long as 
he feels that the group is progressing satis- 
factorily. 

5. Leader makes decisions 


most of the con 


cerning discussion procedures. 


6. Leader does not hesitate to give his 


opinions or information 


own 


7. Leader assumes responsibility for perform 


ing leadership functions and _ discourages 
group members from performing them in 


order to retain his position. 


8. Leader is more concerned with group prod 
uct. 


g. Leader method is more formal and directive 








overseer; the group-centered leader is 
the observer, clarifier, and feedback. The 
differences the 
indicated in the contrasting procedures 


between two roles are 


emploved by the two leaders.* 


I. EXPERIMENTAL PROCEDURE 
Eight groups, ranging from four to 
six members each, this 
study. Each group met for two sessions, 


were used in 


four weeks apart. In each session a prob- 


8’ The terms “leader-centered” and “group- 
centered” are ambiguous at best until they are 
defined functionally. Their use in this study 
implies no more nor less than the definitions 
suggest. 


lem in human relations was discussed 
under controlled leadership conditions. 
The group members, including the lead- 
er, remained the same for both sessions; 
only the leadership method changed. 
Four of the groups were exposed to the 
group-centered aproach in the first ses- 
sion and the other four groups were ex- 
posed to the leader-centered approach. 
These approaches were reversed in the 
second session. 

A. Problems. It was necessary to se- 
cure two problems for use in the experi- 
mental discussions, one for each round. 
Two “human relations situations” were 
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chosen because of their inherent inter- 
est and because they did not require 
specialized knowledge or advance prepa- 
ration. Recognizing that these topics 
might affect the discussions, similar prob- 
lems were used for both sessions. The 
similarity is indicated in the results. If 
the problems themselves were so differ- 
ent as to influence such factors as qual- 
ity or involvement in the discussion, 
then one would expect the differences 
observed between the groups to vary ac- 
cording to the problems they were dis- 
cussing. The results indicated that the 
differences varied according to the lead- 
ership method. 

B. Subjects. The participants in the 
experimental under- 
graduate students enrolled in a speech 


discussions were 


fundamental course at the University 
of Oklahoma. 
experiments were conducted, eighty po- 


Three weeks before the 


tential subjects were given two lectures 
their 
regular class work. The instructor ‘avoid- 


on discussion in connection with 
ed any mention of group-centered and 
The 
were then divided into small groups and 


leader-centered leadership. classes 
presented discussions in class during the 
following week. These discussions were 
observed and graded jointly by the in- 


and the 


structor and 


experimenter, 
forty subjects were selected for the ex- 
periment. Using discussion ability as 
the principal criterion the subjects were 
then divided into eight groups of ap- 
proximately equal ability. It should be 
noted for 


that each group 


sessions with the same leader, the method 


met two 
used by the leader being the prime vari- 
able. Thus, the major results of the 
study are not based upon comparisons 
between the eight groups, but rather 
between the two sessions of each group. 
Perfectly equated groups were therefore 
not necessary. 


C. Leaders. ‘Two graduate psycholo- 


15 
gy students served as leaders. They were 
given extensive training in the two types 
of leadership. This training consisted in 
study and discussion of the two leader- 
ship approaches, followed by practice 
with real groups. These practice sessions 
were continued until the experimenter 
and faculty advisers were satisfied that 
the leaders could perform both types of 
leadership with approximately equal 
skill. “The degree to which they carried 
out the two leader roles during the ex- 
determined by a 
Leadership Check List. 

D. Observers. 


servers 


periment itself was 


There were four ob- 


observers 
were graduate students with consider- 


for each session. All 
able experience in discussion work. In 
addition to their general background, 
the observers were given two weeks of 
intensive training in handling their as- 
signments. 

E. Instruments. Three equated forms 
of an Attitude Scale were designed for 
this study to determine the subjects’ atti- 
tudes toward discussion leadership. This 
scale was administered to each subject 
Form A 
two weeks before the discussions began. 
Form B following Round I, and Form C 
following Round II. 


three times: was administered 


A Sociometric Questionnaire was de- 
signed to determine the subjects feelings 
toward the leader both as the discussion 
leader and as a person. Following each 
discussion the subjects were asked to 
rank the group members, including the 
leader, on the value of their contribu- 
tion to the group, and on whom they 
would like to know better. 

The subjects’ reactions to the experi- 
mental discussions were determined by 
a Rating Scale for Group Member Feel- 
ings. ‘This enumerated sixty-two feelings 
arranged in pairs such as “felt the dis- 
cussion was very enlightening” and “felt 
the discussion was very dull and unen- 








46 SPEECH MONOGRAPHS 


lightening.” These thirty-one “pairs of 
feelings” were placed at opposite ends of 
a five point scale. The group members 
were instructed to mark their feelings 
as falling at any one of the five points 
on the scale. These scales were adminis- 
tered to each subject following each 
round of discussion. 


One week the last 
were completed, the subjects were asked 
to evaluate the two discussions in which 
they had participated. On the Compara- 
tive Evaluation Questionnaire, they were 
instructed to 


after discussions 


between the 
discussions: the one they felt was the 
better discussion, or in which they felt 
more at ease, etc. 


choose two 


The observers recorded their impres- 
sions and evaluations of the discussions 
on The Observer Scale which covered 
many of the same items that were in- 
cluded in the instruments used by the 
subjects. In addition, one observer filled 
out a Leadership Check List. The ob- 
jective of this instrument was to deter- 
mine whether the leaders conformed to 
the roles prescribed by the experiment. 
It consisted of ten contrasting procedural 
differences between the two leadership 
approaches arranged at opposite ends 
of five-point scales. 

In so far as possible each instrument 
was checked for validity and reliability. 
For example, evidence pertaining to the 
validity of the Attitude Scale was ob- 
tained by giving the scale to forty-three 
students other than those used in the 
experiment. Before taking the Attitude 
Scale they were given a mimeographed 
“frame of reference” describing one of 
the leadership approaches. Of the forty- 
three students that took the Attitude 
Scale after reading the frame of refer- 
ence, only two students had scores which 
were 
reference they used. 


inconsistent with the frame of 


Outside observers were obtained for 


seven of the discussion groups to pro- 
vide a check on the regular observers. 
These “casual” observers were not in- 
formed of the nature of the experiment 
until they had filled in the Observer's 
instrument and the Leadership Check 
List. In all cases the correlations be- 
tween their markings and the markings 
of the regular observers were sufficiently 
high to warrant reliability: above .85 in 
all instances. 

The data obtained from each instru- 
ment were treated statistically to deter- 
mine whether the differences that ap- 
peared leader-centered 
groups and the group-centered groups 
were significant, not attributable to mere 
chance or errors in sampling. For pur- 
poses of this study, “significant’”’ means 


f 


significant at the 1% 


between _ the 


level or better.* 


II. RESULTS AND CONCLUSIONS 


A. Leadership Consistency. The de- 
gree to which the two leader roles were 
carried out was indicated by the Leader- 
ship Check List. The scores recorded on 
this instrument showed a significant dif- 
ference on all items between the leader- 
centered leader and the group-centered 
leader. The group-centered leaders had 
a mean score of 4.25 
centered leaders had a 
conformity to the group-centered ap- 


and the leader- 
mean score ol 
(A score of 5.0 indicates perfect 


proach; a score of 1.0 indicates perfect 
conformity to the leader-centered ap- 
proach.) The Comparative Evaluation 
Questionnaire filled out by the subjects 
revealed that 79.5% were aware of the 
differences in the two leadership ap- 
proaches and were able to indicate the 
differences in general terms. 


B. Effect on Attitudes. The attitudes 
of the subjects toward discussion leader- 


4 The formula used for this was the “‘t’’ test 
of significance found in Allen Edwards, Statistt- 
cal Analysis (New York, 1946), p. 177. 








shi 
the 
jus 


the 


the 
as 

ric 
lea 
er 

tio 
mé 
da 

th: 
the 


10. 


the 
SiS’ 
cu 
ite 
sal 
dis 
an 


pro- 
vers. 

in- 
nent 
ver’s 
heck 

be- 
ings 
ntly 
5 in 


stru- 
eter- 
ap- 
ered 
oups 
nere 
pur- 
eans 
r.* 


> de- 
were 
ader- 
d on 
t dif- 
ader- 
fered 
had 
ader- 
‘e ol 
rfect 
ap- 
rfect 
ap- 
ation 
jects 
f the 
ap- 
» the 


tudes 
ader- 


t’’ test 
tatistt- 








GROUP AND LEADER-CENTERED LEADERSHIP 47 


ship tended to shift in the direction of 
the leadership role to which they had 
just previously been exposed. None of 
these shifts was statistically significant. 

Little difference was found between 
the subjects’ attitudes toward the leader 
as a person. However, on the Sociomet- 
ric Questionnaire all groups ranked the 
leader-centered leader significantly high- 
er in terms of the value of his contribu- 
tions to This is in 
marked contrast to the results of other 
data indicating that the subjects felt 
that the group-centered discussions were 
the better discussions. (See Item No. 
10, Table 1.) 


the discussion. 


C. Effect on Group Discussions. Both 
the observers and the subjects were con- 
sistent in rating the group-centered dis- 
cussions significantly higher on most 
items. (See Tables I and II.) It appears 
safe to conclude that the group-centered 
discussions resulted in more involvement 


and a warmer and friendlier atmosphere. 


It was also easier to make contributions 
in the group-centered discussions and 
these discussions were more cooperative. 
The leader rated 
significantly higher in terms of the value 
of his contributions. 


leader-centered was 


III. SoME IMPLICATIONS 


Many implications might be drawn 
from these conclusions. It might be said 
that the group-centered leader was some- 
what more successful in terms of the 
factors being examined in this study. 
The group-centered groups seemed more 
involved, had a friendlier atmosphere, 
were more cooperative and found it 
A large 
majority of the subjects also felt that 


easier to make contributions. 
the group-centered discussion was the 
better discussion. On the other hand, 
the subjects felt that the leader-centered 
leader did the better job. This apparent 
anomaly may be explained in two ways: 
(1) the leadership exerted by the group- 


TABLE I 


FEELINGS EXPRESSED BY SUBJECTS IN A COMPARATIVE EVALUATION OF THE Two DIsCUSSIONS 
AS SHOWN BY THE COMPARATIVE EVALUATION QUESTIONNAIRE. 











Item 
No. Feeling G I X 
(1) Felt the atmosphere was friendlier and warmer 59.0 23.5 17 : 
(2) Felt more involved 73.5 20.5 6.0 
(3) Felt the leader did the better job 29.5 56.0 14.5 
(4) Felt the group got off to a better start 56.0 35.0 9.0 
(5) Liked the way the leader handled the group better 38.0 35.0 24.0 
(6) Felt more at ease in making contributions 59.0 14.5 26.5 
(7) Felt the group was more cooperative 73-5 9.0 17.5 
(8) Felt we reached a more satisfactory solution 41.0 38.0 21.0 
(9) Felt the group made more progress in terms of what could 

be reasonably expected 50.0 38.0 12.0 
(10) Felt was the better discussion 67.5 23.5 9.0 


(11) Noticed a difference in the way the leader acted in the first 
discussion and in the way he acted in the second discussion 


(79.5% Yes and 20.5% No.) 








Key: G—Group-centered discussion 
L—Leader-centered discussion 


X—No difference, no opinion or can’t remember well enough 
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TABLE II 


MEAN Scorkgs FOR ALL Groups RECORDED JOINTLY BY OBSERVERS ON OBSERVERS SCALE. 
(Significant items only) 








Item 
No. Feeling! GC LC Mean Diff. 
(1) Felt atmosphere in the group was friendly, 
warm and permissive 4-25 3-25 1.00* 
(2) Felt group worked well together as a team 4.375 3.00 1.375° 
(3) Felt the group was predominantly cooperative 4625 3.375 125° 
(4) The group seemed very involved in the discussion 4-5 3.00 1.pe® 
(5) The participants assumed much of the responsibility 
for group progress 4-125 2.75 1.375%* 
(6) Member contributions were accepted and 
appreciated by the group 4-375 3-5 0.875,** 
(7) Felt the group members were generally 
considerate of others 4.5 RLF 1.0%* 
(8) Felt the group accepted the 
leader, for the most part 4-75 4.25 0.5,* 








*Significant at 5% level. 
**Significant at 1% level. 


Key: GC—Mean of scores for group-centered groups. 
LC—Mean of scores for leader-centered groups. 


1 The feelings recorded in this column are in every instance the most desirable feeling. A 
score of 5 indicates perfect agreement with that feeling. A score of 1 indicates perfect agreement 
with the opposite feeling. The opposite feelings do not appear here. 


centered leader was more subtle than 
that of the leader-centered leader. Thus, 
the subjects may not have been able to 
discern and evaluate the contributions of 
the group-centered leader.® (2) The struc- 
ture of the leader-centered role was more 
sugges- 
tions, talking more frequently and _ pro- 


conducive to making content 
viding more direction. These factors are 
more in line with the usual naive ex- 
pectations of what constitutes valuable 
leader contributions. The subjects did 
not seem to be aware of the effects of 
procedural suggestions, silence and non- 
direction. This would suggest that the 
group-centered leader is not likely to 
receive much appreciation or recogni- 
tion for his leadership services although 
he is likely to be more “successful” (in 
terms of involvement, cooperativeness, 
etc.) in leading his group. 

5It is noteworthy that the observers ranked 
the leader-centered leader only .5 points higher, 
whereas the subjects ranked him 2.03 points 
higher. 


For those leadership situations in which 
involvement, friendli- 


ness of atmosphere and ease in making 


( ooperatiy eness, 


contributions is desired, it would seem 
that a group-centered type of approach 
would be advantageous. If it is granted 
that these factors ordinarily make for a 
“better” discussion then it appears that 
leader training programs should be ori- 


ented toward this approach. 

Unwarranted applications of these 
findings should be avoided. Leadership 
is dependent upon a number of differ- 
the group 
goal, personality factors, etc.), and all 


ent factors (the situation, 
of them should be taken into considera- 
tion in selecting and using any leader- 
ship method or approach. Although 
the results of this study may not be defin- 
itive, it is clear that a knowledge of the 
differential effects of various leadership 
types should be helpful both in training 
leaders and in leading discussions. 
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a The three experimental studies of listenability here reported were jointly 
~ done by Kenneth A. Harwood,' Francis A. Cartier and Leo Goodman-Mala- 
muth,? under the direction of the writer. For all studies listenability was 
~~? ° . . 5; 
= defined as a characteristic of spoken language, matching the accepted usage 
en of readability as a characteristic of written language. 
42 ° . . ° i 
The experimenters first considered the possible construction of a 
ee listenability formula by which the degree of difficulty of any spoken lan- 
guage sample might be quantitatively measured and predicted. This plan 
* was rejected, however, in view of the fact that several well-tried readability 
formulas were already available. It was then decided to design a series of 
experiments to find out what would happen when the listenability of lan- 
guage samples was predicted by means of readability formulas. Two types 
eA of formulas were tested; predicted readability was compared with obtained 
ment listenability; and the effects of different speaking rates were measured at 
different levels of language difficulty. 
which These problems were chosen, and the experimental details planned, in 
ondii- the light of previous research, since a review of the literature showed that 
aking most of the related studies were concerned with readability or with indi- 
nen viduals’ listening behavior, rather than with listenability. 
roach 
anted 
for a 
, that een = ; a 
eat. I. LISTENABILITY AND READABILITY 
KENNETH A. HARWOOD 
University of Southern California 
these 
rship on , : = 7 : ; , 
GA : ITH the increasing recognition § efficient listening. By comparison, the 
iffer- ‘ : : : Nhe ; : 
of the importance of listening to design of language for optimal transmis- 
rr . ° — ° a ° ° 
OUP | the functions of day-by-day living, re- sion of information to the listener has 
d all | cent investigation of listening has been neglected. Likelihood that espe- 
idera- | tended toward demonstrating the bene- cially listenable language may be de- 
ader- | fits of training the listener for more signed has been heightened by the de- 
ough velopment of readability formulas for 
1Kenneth A. Harwood, “An Experimental F P . , ‘cae , 
defin- Comparison of Listening Comprehensibility with predicting the reading difficulty of 
f the | Reading Comprehensibility,” Ph.D. dissertation, snted 1s a 
hj University of Southern California, 1950. printed language. 
Tship 2 Francis A. Cartier, “An Experimental Study - . a . . , . 
Sian of the Effect of ‘Human Interest’ Factors on The prinicpal — of this study 
In1NS | Listenability,” Ph.D. dissertation, University of was to investigate some relationships 
Southern California, 1951. aie <i ond ae ' 
3 Ph.D. dissertation still in process. etween written and spoken language 
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of various levels of difficulty. It was 
hoped that such investigation might in- 
dicate what differences, if any, existed 
between the comprehensibility of these 
two modes of language presentation. 
Given among certain conditions, a series 
of language samples graded for readabil- 
ity and presentation of each sample as 
(a) a page for silent reading and as (b) 
a talk recorded for listening, the prob- 
lem of this study was to determine (1) 
the mode of presentation in which each 
sample was more comprehensible, (2) an 
amount by which each sample was more 
comprehensible in one mode than the 
other, (3) the mode of presentation in 
which the series was more comprehensi- 
ble, and (4) an amount by which the 
series was more comprehensible in one 
mode than in the other. 


Comprehensibility was taken to mean 
the attribute of a language sample de- 
scribed by the number of correct an- 
swers to questions about a language 
sample; and a language sample was 
taken to mean the wording of any one of 
seven stories. 

Reports of eighteen studies most close- 
ly related to this study revealed that the 
reporters were more concerned with ex- 
ploration of individual abilities than 
with the mass effects of the quality of 
language used, that results of the stud- 
ies have not been in accord, that the 
relationship between reading and listen- 
ing is apparently a rather complex one, 
and that there has been a considerable 
variety of decision on the optimal pro- 
portion of artificiality and high control 
to natural situation and low control of 
the experiments. At least in its concern 
with language more than with individ- 
ual abilities 
strike a 


and in its attempts to 


balance between artificiality 
and naturalness of situation, this study 
was distinguished from previous com- 


parisons of reading with listening. 
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I. METHOD 


Seven stories were constructed to rep- 
resent each of seven different levels of 
predicted reading comprehensibility. 
Each story was accompanied by a set of 
test questions about the information 
in it. All stories were presented aurally 
to one group of subjects and visually to 
another similar group of subjects. The 
same asked of each 
the amount of time devoted to 
each presentation and to each test was 
held constant. The mean number of 
test questions answered correctly subse- 
quent to aural presentation was com- 
pared with the mean number answered 


questions were 


group; 


correctly subsequent to visual presen- 
tation. 


Copy for each story was selected on 
these bases: that it be of general inter- 
est, that it be unidentifiable with widely 
publicized events, that it be unidentifi- 
able with any particular day or month, 
that there be low likelihood of the 
subjects’ prior encounter with broad- 
casts or printed versions of it, that it 
like might 
emanate from a radio loudspeaker yet 
read like copy prepared for silent read- 
ing, that it constitute a sense-making 


sound = reasonably what 


whole, that it be exactly 300 words 
long, that the complexity of its wording 
be representative of the midpoint of one 
of the seven ranges of difficulty pre- 
dicted by the Flesch readability formu- 
las, and that its human interest value be 
representative of the mid-point of the 
middle range 


dicted by those formulas.? 


of human interest pre- 
The stories 
were prepared from United Press Radio 
feature copy, from transcripts of radio 
commentary and reportage of the Co- 
lumbia Broadcasting System, and from 


transcripts of public addresses. 


1 Rudolf Flesch, “A New Readability Yard- 
stick,” Jour. of Applied Psychology, 32 (1948), 
221-233. 
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Fifteen five-part multiple-choice ques- 
tions for each story were constructed on 
these bases: that the number of questions 
for each story be as great as practicable, 
that their range of difficulty be as nar- 
row as possible, that the information 
necessary for correct response be as dis- 
crete as possible, that chance success of 
correct response be minimized, that they 
not be “catch” questions, that they be 
rather brief, that their phrasing be about 
as difficult as the materials of the story 
to which they applied, that they cover 
the information in the same order as 
that of its appearance in the story, that 
the order of the correct multiple-choice 
response be random, and that each of 
the presented answers seem reasonable 
in the light of its question. 

The 


duction 


voice selected for aural 


of 


repro- 
the stories that of an 
adult male speaker of average profes- 
sional speech training. Recordings of 
the stories were made with a Soundmir- 


was 


ror model BK 401 magnetic tape record- 
er and an Acoustical model BA 106 crys- 
tal microphone. Since the rate of deliv- 
ery was 150 words-per-minute, the run- 
ning time of each story was two minutes, 
and the rate each story was 
accurate to plus or minus one-half sec- 
ond per fifteen seconds of delivery. The 
volume of the recordings was calibrated 
to allow playback of all seven stories at 
the same level of loudness, and all re- 
cordings were judged to be of good 
quality. 


within 


The visual reproduction of the stories, 
questions, and answer sheet was by good 
quality mimeographing on a good grade 
of 8144 by 11-inch mimeograph bond 
paper, with pica typewriting. The read- 
ing test booklet consisted of a 
instruction sheet followed by 


covering 
a blank 
page, a page of story, a page 
The listening 
test booklet was the same except for 


of ques- 
tions, a blank page, etc. 


omission of the pages bearing the stories. 

Of the 240 subjects tested, 120 took 
the reading tests and 120 others took 
the listening tests. Readers and listeners 
were drawn from the tenth grade of the 
Compton, school 
Compton is a residential suburb of Los 
Angeles characterized by middle social 
rank and average urbanization.? Read- 
ers did not differ significantly from 
listeners in distribution of age, sex, in- 


California, system. 


telligence quotient, and reading com- 
prehension grade placement. Subjects 
were tested in six groups of about forty 
subjects per group. Testing was done 
during normal in the 


usual classrooms, and with the regular 


school hours, 
teachers present. For a test of reading, 
subjects were allowed two minutes to 
read each story and the two minutes 
immediately following to read and to 
respond to the questions. For a test of 
listening, subjects were presented with 
a two-minute recording and allowed the 
two minutes immediately following to 
read and to respond to the questions. 
Responses, in the form of circles drawn 
around numbers on the answer sheet, 
were corrected for probability of chance 
success. Differences between means were 
tested by t-ratio for significance. 


II. CONCLUSIONS 


The data of Tables I and II showed 
high face validity and appeared to war- 
rant the following statements: 

1. Language predicted to be very 
easy to read was only insignificantly 
more comprehensible when presented 
for listening than when presented for 
reading. 

2. Language predicted to be easy to 
read was only insignificantly more com- 
prehensible when presented for read- 
ing than when presented for listening. 

2 Eshref Shevsky and Marilyn Williams, The 


Social Areas of Los Angeles (Berkeley, Calif., 
1949), Pp. 161. 
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TABLE I 
COMPARISON OF READABILITY WITH LISTENABILITY OF STORIES. 
; Predicted Kind of Mean 
Story Reading Ease Presentation Score* Difference t Probability 
7 Q5-11 or Reading 10.09 = ~ g 
Very easy Listening 10.36 ae -” > % 
1 85.22 or Reading 9.84 
Easy Listening 9.79 05 12 > 05 
6 75.02 or Readin 7 
49 - = 4 9.97 > eo - 
Fairly easy Listening 9-35 ” 138 > 5 
2 65.04 or Reading 8.45 , S : 
Standard Listening 8.96 9! = > 5 
5 54-97 or Reading 9.52 ? one 
Fairly difficult Listening 8.19 33 oe = 
3 39.90 or Reading 7.18 : : 
Difficult Listening 8.RO one aa — 
4 15.08 or Reading 6.04 oa : 
Very difficult Listening 6.15 — + > 5 
*N for each mean was 120. 
TABLE II 
COMPARISON OF READABILITY WITH LISTENABILITY OF SERIES. 
Kind of Presentation Mean Score* Difference t Probability 
Reading 8.73 , 1s 07 
Listening 8.32 4 is > % 








*N for each mean was 120. 


3. Language predicted to be fairly 
easy to read was only insignificantly 
more comprehensible when presented 
for reading than when presented for lis- 
tening. 


4. Language predicted to be standard 
to read was only insignificantly more 
comprehensible when presented for lis- 
tening than when presented for reading. 


5. Language predicted to be fairly 
difficult to read was very significantly 
more comprehensible when presented 
for reading than when presented for 
listening. 


6. Language predicted to be difficult 
to read was very significantly more 
comprehensible when presented for read- 
ing than when presented for listening. 

7. Language predicted to be very 
difficult to read was only insignificantly 


more comprehensible when presented 


for listening than when presented for 
reading. 
8. ‘Taken the of 


language samples was only insignificant- 


as a whole, series 
ly more comprehensible when presented 
for reading than when presented for 
listening. 


Ill. 
following 


RELATED QUESTIONS 
The 
others, are suggestive of further study: 


questions, among 


1. Should development of a listena- 
bility formula be undertaken? Construc- 


tion of such a formula seems possible. 


Results of this study indicate that al- 
though under certain conditions a read- 
ability formula might in general pre- 
dict listening comprehensibility about as 
well as it could predict reading compre- 
hensibility, at some levels of difficulty 
listening comprehensibility might re- 
quire a prediction significantly different 
from that for reading comprehensibility. 
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2. What effects, if any, are connected 
the communication in 
which test questions are presented and 


answered? 


with mode of 
In order to respond to the 
listening test questions of this experi- 
ment, subjects had to read the printed 
test questions and answer sheet. Results 
might be altered by oral presentation 
and/or answering of questions. 

3. What are the effects, if any, of 
loudness, pitch, rate, phrasing, emphasis, 


stress, voice quality, and pronunciation 
upon listening comprehensibility as com- 
pared with the effects, if any, of size, 
shape, color, and texture of paper and 
print upon reading comprehensibility? 

4. What are the effects, if any, of the 
presence, dress, height, weight, color, 
movement, etc., of the speaker seen in 
image or in person upon listening com- 
prehensibility? 


II. LISTENABILITY AND “HUMAN INTEREST” 


FRANCIS A. CARTIER 
Headquarters, Air Force ROTC, Montgomery, Alabama 


HE need for research in listenability 

has been indicated by Vernon, who 
found that about half of Britain’s radio 
listeners seemed to comprehend and re- 
tain practically nothing from the aver- 
age informational broadcasts.‘ Similar- 
ly, Chall and Dial reported that many of 
the newscasts they studied were 
difficult for the average listener in this 
the of this 
study to clarify some of the issues in- 


too 


country.? It was purpose 


volved in the solution of this problem. 


There is a considerable literature re- 
porting research in readability, i.e., in 
the comprehensibility of written lan- 
guage, and it was felt that the close cor- 
respondence between problems of reada- 
bility and listenability might be used 
to advantage in the study of the latter. 
\ccordingly, several published formulas 


designed for determining the difficulty 


| that different styles of writing present 


to the reader were examined for possi- 
ble use in predicting the difficulty of 


1P. E. Vernon, Investigations of the Intel- 
ligibility of Educational Broadcasts, paper read 
before the British Association for the Advance- 
ment of Science, Sept. 5, 1950. 

2J. S. Chall and H. E. Dial, “Predicting 
Listener Understanding and Interest in News- 
casts,” Educational Research Bull. 27 (1948), 
141-153. 


different styles of spoken language. The 
most practical appeared to be those de- 
vised by Flesch.’ Flesch’s “Yardstick” of 
readability consists of two separate for- 
mulas: (1) “reading ease” and (2) “hu- 
man interest.” The latter formula was 
designed to measure the interest value 
of written material through the effect of 
interest on comprehension. The argu- 
ment was that everyone is interested in 
people and that a measure of the extent 
to which a language sample refers to 
people is, therefore, a measure of the 
interest value of the language sample. 
These assumptions may be tested by 
examining the effect on comprehension 
of an increase or decrease in those ele- 
ments in a sample of spoken language 
which direct 
sample's 


the 
interest” 
value. This was the procedure in the 
present study. 


bear a relation to 


presumed “human 


It is widely appreciated that compre- 
hension varies closely with interest. Ver- 
non, for example, has said, “. . . by far 
the most outstanding result of our in- 
vestigation [of the listenability of a 
number of BBC talks] is that the most 


3 Rudolf Flesch, “A New Readability Yard- 
stick,” Jour. of Applied Psychology, 32 (1948), 
221-233. 
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important factors making for [listen- 
ability] are the intelligence and educa- 
tional level of the listeners on the one 
hand, and the interest aroused by the 
content of the talk on the other.’* 

For evidence that one way of being 
interesting is to talk about people, one 
may refer to statements of any 
number of journalists and editors or to 
the study of newscasts by Harrell, Brown 
and Schramm, which stated that the 
highest comprehension scores obtained 


the 


by their subjects on the various types 
of newscast items studied were on “hu- 
man interest” items.® 


One possible quantitative measure of 
the extent to which a language sample 
deals directly with people appeared to 
be the number of “personal words.” 
This study employed the definition given 
by Flesch: “All nouns with natural gen- 
der; all pronouns except neuter pro- 
nouns; and the words people (used with 
the plural verb) and folks.” 


This study examined the applicability 
to spoken language of the “human in- 
terest” formula of Flesch’s “New Read- 
ability Yardstick.”” The formula was: 
HI = 3.635 pw + .314 ps, where H/ was 
an index of predicted readability and 
interest, pw was the percentage of “per- 
sonal words” (you, people, our, him), 
and ps was the percentage of “personal 
sentences” (sentences addressed to the 
reader, verbatim conversation, 
sation-like sentences). 


conver- 
It was apparent 
from its low weighting that ps contrib- 
uted little to the formula and, since the 
interactions of ps with pw were un- 
known, it was considered wise to test 
the effect of pw alone on listenability, 
i.e., on the comprehensibility of spoken 
language. 


4 Vernon, Investigations. 

57. W. Harrell, D. E. Brown and W. 
Schramm, “Memory in Radio News Listening,” 
Jeur. of Applied Psychology, 33 (1949), 265-274. 


The study asked six questions. 

1. What effect does varying the H/ 
score of spoken language by one or two 
steps in Flesch’s five-step scale (“Dull,” 
“Mildly interesting,” “Interesting,” 
“Highly interesting,” and “Dramatic,” 
representing H/ scores of 5, 15, 30, 50, 
and 80, respectively) have on the listen- 
ability of the language? 

2. Does such an effect, if any, vary 
according to the difficulty of the lan- 
guage as measured by the “reading ease” 
formula of the “Yardstick’’? 

3. Does such an effect, if any, vary 
with the listening ability of the subject? 

4. Does the HI score or varying the 
HI score of spoken language have a 
different effect on its listenability for 
male and female listeners? 

5. What percentage of comprehen 
sion can tenth-grade junior high school 
students be expected to obtain from 
recorded speech? 

6. Are the difficulty predictions of 
the Flesch “reading ease” formula con- 
sistent when applied to spoken lan- 
guage and when certain factors of the 
language style other than those measured 
by that formula are varied slightly? 


I. METHOD 


Three versions of each of seven stories 
were prepared so that there was a ver- 
sion with H/ 15, a version with HI 90, 
and a version with HI 50, of a story at 
each of the seven levels of difficulty® as 
predicted by the “reading ease’’ formu- 
la of Flesch’s “Yardstick.” This formula 
was: RE = 206.835 — .846 wl — 1.015 
sl, where RE was an index of predicted 
reading ease, wl was the number of 
syllables per 100 words (a measure of 
average word length), and s/ was the 


6 Qualitatively, these RE levels are described 
hy Flesch as Very easy, Easy, Fairly easy, Stand- 
ard, Fairly difficult, and Very difficult, and 
correspond to the range of styles from that of 
comic books to that of scientific journals. 
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number of sentences per 100 words (a 
measure of sentence length and hence 
of sentence complexity). No “personal 
sentences” appeared in any of the ver- 
sions. All seven stories were drawn from 
materials 


originally designed to be 


broadcast. The three versions of each 
story were 300 words in length and con- 
tained the same of sentences 


and the same number of syllables. Thus, 


number 


though the seven stories represented 
seven different RE scores, the RE score 
of each set of three versions was constant. 


A fifteen-question five-choice multiple 
choice test of comprehension was con- 
structed covering the content of each of 
the seven stories. As the ideas remained 
intact in the revised versions, a given 
test was applicable to any of the three 
versions of that story. These tests were 
mimeographed and stapled into test 
booklets covered by an_ instruction 
sheet. An answer form was also mimeo- 
graphed. 

A speaker was chosen by a group of 
experts as being very close to their idea 
of “an average trained speaker.’ This 
speaker made tape recordings of all 
twenty-one scripts at a speaking rate of 
150 words per minute, with rate con- 
trolled by check-points at fifteen-second 
intervals. Timing was accurate to within 
one-half second, and volume was careful- 
ly controlled. 


The recordings were then edited into 
three tapes of seven recordings each. The 
first tape held all the HI go versions of 
the seven stories, and the HI 15 and H/ 
50 versions were 
between the 


divided alternately 


tapes, 


third 
keeping story order the same. 


second and 


Each tape was presented to a different 
group of over 100 tenth-grade students 
in Compton, California. The presenta- 
tion followed the same pattern in all 
cases and consisted of playing the first 
story and then allowing the subjects 


two minutes (found to be quite ade- 
quate) to answer the fifteen questions 
on that story. The second story was then 
played and the subjects answered the 
questions relevant to that story, and so 
on. Groups were equated for sex, age, 
and IQ. Scores on the multiple choice 
tests were corrected for chance before 
statistical treatment of the result. 


II. CONCLUSIONS 


The results of the statistical analysis, 
using t and chi square, appeared to 
warrant the following conclusions: 


1. In general, varying the HI of the 
stories by one or two steps in Flesch’s 
five-step scale of “human interest” had 
no significant effect on the listenability 
of the stories as measured by audience 
comprehension. (See Table II). 


2. This appeared to be true regard- 
less of the difficulty level of the story as 
predicted by the “reading ease” form- 
ula, 


g. Varying the HI of the stories by 
two steps did not make them significant- 
ly less or more listenable for the poorer 
listeners when “poorer” was taken to 
mean “obtaining a score below 7.0,” or 
when “poorer” was taken to mean “ob- 
taining a score below 4.0.” 


4. None of the three levels of HJ 
was more listenable than another level 
for either male or female listeners, nor 
did varying the HI score of any story 
have any significantly different effect 
on its listenability for male subjects as 
compared to female subjects. 


5. The comprehension scores were 
considered to be quite low. One half of 
the subjects obtained scores indicating 
that they comprehended only about half 
of what they heard, while a fourth of 
the subjects were apparently incapable 
of comprehending more than a third of 
the material. Only a fourth of the sub- 
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TABLE I 
MEAN CorRECTED COMPREHENSION SCORE AND STANDARD ERROR OF THE MEAN FOR EACH STORY. 
Story 
Number RE HI Mean S.E. x5 N 
7 95 50 10.65 26 106 
30 10.36 26 120 
15 9.98 28 104 
1 85 50 10.01 32 104 
3° 9-79 31 120 
15 10.01 -30 104 
© 75 50 8.33 37 104 
30 9-35 -32 120 
15 9.52 ‘35 106 
2 65 50 9-34 -2¢ 104 
30 3.96 34 120 
15 9-52 35 105, 
5 Ae 50 8.22 -32 106 
30 8.19 31 120 
15 7-57 33 104 
3 40 50 5.51 31 106 
go 5-50 34 120 
15 4.93 .28 104 
4 15 50 5-59 ‘35 104 
30 6.15 35 120 
15 6.78 39 106 
TABLE II 
DIFFERENCFS AND CRITICAL RATIOS AMONG THE THREE VERSIONS OF EACH STORY. 
; : 5 a 7 
Story Difference diff. C.R. 
Number RE HI HI 30 HI 1s HI 30 HI 15 HI go HI 15 
7 95 50 29 57 37 39 78 1.72 
30 38 39 ‘97 
1 85 50 .22 0.00 45 44 19 0.00 
30 22 44 50 
6 75 50 1.02 1.19 49 51 2.08 2.33 
30 17 47 37 
2 65 50 38 18 45 45 84 40 
30 50 49 1a 
5 Aa 50 03 55 45 46 07 1.20 
30 2 46 1.13 
3 40 50 Ol 58 47 42 02 1.38 
go 57 45 1.27 
4 ¥ 3 50 .46 1.00 49 e2 Of 2.10 
30 63 52 1.21 








jects appeared to comprehend as much 
as two-thirds of the material. 

6. As shown Table I, each diffi- 
culty prediction of the “reading ease” 
formula appeared to be quite consistent 
for spoken language over the range of 
style variation exemplified by the three 
versions of each story. 


in 


III. 
The low comprehension scores are in 
correspondence with the findings of sev- 


DISCUSSION 


eral previous researchers and confirm 
the seriousness of the mass communica- 
tion problem in this country today. 


The “human interest’ formula of 
Flesch’s “New Readability Yardstick” 
does not appear to be useful as a pre- 
dictor of listenability. It may be that if 
the experiment had investigated even 
wider differences in HI, a significant 
effect on listenability would have been 
found, but if the formula is capable only 
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of measuring such gross differences, its 
value is questionable. However, it may 
be of value in ways not tested in this 
study, e.g., as an index of the “attrac- 
tiveness” of a language sample, and 
hence an index of its ability to keep 
a listener from “tuning out.” 
Apparently, some variation of those 
language factors which are not meas- 
ured by the “reading ease”’ formula (e.g., 
the insertion or deletion of personal 
words and the vocabulary and grammat- 
ical changes necessary to do this) does 
not affect the consistency of the RE 
formula’s prediction. It should be real- 
ized that the variations from version to 
version in this study were made as 
slight as possible while still giving the 
HI scores required, but even so, the re- 
vision of a 300-word story so as to give it 
29 more personal words than it had 


III. 


without altering its length in words and 
in number of syllables cannot. be called 
a minor change of style. The “reading 
ease” formula therefore shows promise 
of being a useful basis for research to- 
ward a listenability formula. 

However, the mere mechanical revi- 
sion of script so as to increase the per- 
centage of personal words, as was done 
answer the 


problem of writing “more personal” 


here, does not appear to 


prose. It appears that it may be the 
the 
writer which is important, and that this 


initial personalized viewpoint of 


viewpoint may incidentally result in a 
high percentage of personal words. The 
practical conclusion may be that no mat- 
ter how useful predicting formulas may 
be in their proper role of predicting, 
they are not necessarily valuable in pro- 
viding rules for the writer. 


LISTENABILITY AND RATE OF PRESENTATION 


KENNETH A. HARWOOD* 
University of Southern California 


TUDIES by Goldstein and by Nel- 

son reported that increase in rate of 
presentation was accompanied by de- 
crease in listening comprehension,’ but 
neither of these studies reported the 
significance of difference, if any, in lis- 
tenability of materials of different pre- 
dicted difficulty, and both used lan- 
guage samples of only two different 
grades of predicted difficulty. 

Accordingly, given among certain. con- 
ditions, a series of language samples 
graded for seven different levels of pre- 


*This report is made through the coopera- 
tion of Goodman-Malamuth and with the aid 
of funds granted by the University of Alabama 
Committee on Research. 

1 Harry Goldstein, Reading and Listening 
Comprehension at Various Controlled Rates 
{New York, 1940); Harold FE. Nelson, “The 
Effect of Variation of Rate on the Recall by 
Radio Listeners of ‘Straight’ Newscasts,” Speech 
Monographs, 15 (1948), 173-180. 


dicted readability, and presentation of 
the series at four different rates of de- 
livery, the problem of this study was to 
determine (1) whether listenability of 
the language samples was affected signif- 
(2) 
whether the effects of such presentation 
the different 
and incidentally, (3) whether readability 
was a consistent predictor of listenabili- 
ty at the different rates. 


icantly by rate of presentation, 


were consistent at rates, 


I. METHOD 


For the study a series of seven stories 
was constructed in such a way that the 
predicted readability of one story fell 
at the mid-point of one of each of seven 
successive ranges of reading ease de- 
scribed by Flesch and that the predicted 
human interest value of each story fell 
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at the mid-point of the middle range 
of human interest described by him.? 
Each story was 300 words long. Copy for 
the stories was selected from United 
Press Radio teletype dispatches, from 
transcripts of commentary and reportage 
of the Columbia Broadcasting System 
radio network, and from transcripts of 
public addresses. 

Each story was tape-recorded at the 
separate rates of 125 words-per-minute, 
50 words-per-minute, 175 words-per- 
minute, and 200 words-per-minute, in 
the same male voice which, in an an- 
cillary study, had been selected by ex- 
pert judgment from a group of four 
male voices as having delivery most like 
that of an average trained speaker. The 
rate of delivery was accurate to plus or 
minus one-half second for each fifteen 
seconds of delivery and to plus or minus 
one-half second in delivery of each whole 
story. The recordings were assembled 
into four tapes where each tape con- 
tained all of the stories at one rate of 
presentation and all tapes contained 
the same order of presentation of the 
stories. These recordings, which were 
thought to be of good quality, were 
calibrated for volume in order to permit 
playback of all stories at the same level 
of loudness. 


Subjects were 487 students in the 
tenth grade of the Compton, California, 
school system, who were tested in four- 
teen classes of about thirty-five subjects 
per class. Each different fourth of the 
students heard all of the stories at one 
of the four different rates of presenta- 
tion. The four groups did not differ 
significantly (chi-square.,,) from each 
other in distribution of sex. Additional- 
ly, the subjects who heard the presenta- 


tion at 200 words-per-minute and _ those 


2 Rudolf Flesch, “A New Readability Yard- 
stick,” Jour. of Applied Psychology, 32 (1948), 
222-233. 


who heard the presentation at 175 
words-per-minute did not differ signif- 
icantly (t.,,) in distribution of age, 1.Q., 
and reading comprehension grade place- 
ment. Although the subjects who heard 
the presentation at 125 words-per-min- 
ute differed significantly (t.,,) from 
each of the two other groups in one o1 
more of the attributes of age, I.Q., and 
reading comprehension grade placement, 
such differences were small enough to 
have had apparently little, if any, sub- 
stantive effect on the major findings 
of the study. 


During the testing, room temperature 
was moderate, natural lighting was ade- 
quate, noise level was extremely low 
and ventilation was good. Standardized 
oral instructions were given to all sub- 
jects. Upon query, no subject indicated 
that a presentation was not clearly au- 
dible. Immediately after the playback 
of each story, subjects were allowed 
two minutes in which to read and re- 
spond to fifteen five-part multiple-choice 
questions about the story. Responses 
were in the form of drawn 
around numbers on an answer sheet. 
After tabulation, but before other statis- 
tical treatment, these responses were 
corrected for probabilitv of chance suc- 
cess. 


circles 


II. CONCLUSIONS 


Were listenability or the language 
samples affected significantly by rate of 
presentation? 

1. In general, as had been found by 
Goldstein and by Nelson, listenability 
decreased with increase in rate of pres- 
entation. 

2. However, mean listenability at 
each of the four rates of presentation 
did not differ significantly (t.,;) from 
that at any other. 

Were effects consistent at 


rates of presentation? 


different 
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FABLE I 


RANKED COMPREHENSIBILITY OF STORIES* 














Predicted Obtained for Listening 

for 125 words 150 words 175 words 200 words 
Reading per minute per minute per minute per minute 

1 1 1 1 1 

2 2 2 2 2 

3 f 4 3 3 

3 3 { { 

5 5 i) 5 5 

6 7 7 7 7 

7 6 6 6 6 

*Rank 1 indicates greatest comprehensibility. 

TABLE II 


INTFRCORRELATION 


(RHO) oF RANKED COMPREHENSIBILITY * 








Rate of 
Presentation 


150 Words 
per minute 


125 Words per minute 1 
150 Words per minute 
175 Words per minute 
200 Words per minute 


00 


175 Words 200 Words Predicted Reading 


per minute per minute Comprehensibility 
.96 .96 .96 
96 .96 .96 
1.00 -93 
‘93 








*For each value N equals 7. 
for each value 


3. Size of the decrease in listenability 
with increase in rate of presentation 
did not appear to be a simple function 
of either predicted readability or ob- 
tained listenability. 

4. Rank-order of the 
stories at each rate of presentation was 


listenability 


highly consistent with that at every 
other. As shown in Table II, rank-order 
intercorrelations were very significantly 
(t.,,) greater than zero. 


5. However, the story whose readabil- 
ity was predicted to be fairly difficult 
found to significantly  (t.,,) 
less listenable when it was presented at 


was be 
the rate of 200 words-per-minute than 
when it was presented at the rate of 
175 words-per-minute, although neither 
the four stories more-listenable nor the 
two stories less-listenable than this one 
showed significantly (t.,,) different listen- 
ability at these two rates of presentation. 
Notwithstanding the fact that the ob- 
tained 


listenabilitv and the predicted 


Probability that rho is not greater than zero is less than .o1 


readability of this story were neither 
extremely high nor extremely low, the 
story might be considered to contain 
more technical information than any of 
the others. Apparently, at least one 
function important to listenability was 
not identified in this study. 


Was readability a consistent predic- 
tor of listenability at the different rates 
of presentation? 


6. Rank-order of the 
stories at each rate of presentation was 
highly consistent with predicted rank- 
order readability of the stories (see 
Table I). All rank-order correlations 
were very significantly (t.,,) greater than 
zero. 


listenability 


7. Within the limits and under the 
conditions studied here. readability 
might be used as a gross predictor of 
listenability if accuracy of the order of 
not more than plus or minus one rank 
of listenability within a total range of 
seven ranks is desirable. 








THE COMPARATIVE INFLUENCE ON AUDIENCE OPINION 
OF PANEL DISCUSSION AND FORMAL DEBATE* 


JAMES A. GRISSINGER 
Otterbein College 


erage there has been consider- 
able experimental research deal- 
ing with the effects of discussion and de- 
bate on audience opinion, questions still 
challenge the investigator interested in 
persuasion, conference speaking, and fo- 
rensics. It is the purpose of this study to 
supplement previous information §con- 
cerning (1) the comparative net influ- 
ence of panel discussion and formal de- 
bate on the opinion of an audience on a 
controversial public question; and (2) 
to investigate the difference in the pat- 
tern of opinion change registered dur- 
ing the course of panel discussion and 
formal debate. 


I. EXPERIMENTAL PROCEDURE 


In planning and preparing the panel 
discussion and debate for use as stimuli 
in the experimental situation, as many 
variables as possible were controlled. 
First, four persons with training and 
experience in both discussion and de- 
bate were chosen. All were thoroughly 
familiar with the topics to be discussed, 
“Should We Press Now for World Fed- 
eral Government?” and debated, and 
“Resolved: That We Should Press Now 
for World Federal Government.” The 
four participants, two students and two 
members of the speech department staff, 
met with an experienced moderator for 
a preliminary session at which they 
agreed upon an agenda to be followed 
while recording the panel discussion. 


*Based upon M.A. thesis, The Ohio State 
University, 1949, directed by William E. Utter- 
back. 


The participants, who held widely vary- 
ing views on the subject, then met with 
the -moderator a the 
unrehearsed panel discussion was record- 


week later and 
ed on phonograph records. The entire 
Disc 


recordings were used to insure identical 


session ran thirty-four minutes. 


' 


— 


performances before all audiences, and ' 


to make possible stenographic trans 


cription and subsequent analysis—a 


necessary phase of this study. 


The phonograph records were trans- 
cribed, and from the transcripts, affirm 
ative and negative debate briefs were 
assembled. In preparing these detailed 
briefs, great care was taken to include 
only material used in the recorded dis- 
cussion. Arguments, examples, facts, 
opinions, and testimony were lifted di- 
rectly from the discussion transcripts. No 
Naturally, 
some difficulty was encountered in the 


other material was included. 


preparation of the briefs since the dis- 
cussion represented four varying opin- 
ions, while the debate was to present 


only two viewpoints. Fortunately, the 


opinions of the panel members generally 


fitted into either the pro or the con 
brief. 

A comparison of the briefs with the 
opinions expressed during the discussion 
made it possible to choose for the de- 
bate recording the two members of the 
panel whose points of view most clearly 
forth in the 
To make the debate more clear- 


coincided with those set 
briefs. 
cut and to save valuable time, the mod- 
erator of the discussion (who served as 


chairman of the debate) prepared ai 


in 


in 
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short, expository speech. This material, 
too, was drawn directly from the dis- 
cussion transcript. While this procedure 
was somewhat at variance with the usual 
two-man debate technique, it served to 
relieve the affirmative speaker of the 
necessity of spending much of his time 
explaining what was already agreed up- 
The 
two-man debate was used rather than 


on in the debate and discussion. 


other styles because of the difficulty of 
crowding four constructive and four re- 
buttal speeches into thirty-five minutes, 
the allotted testing time. The debaters 
were given the briefs of their cases and 
allowed several weeks to prepare 
speeches. They were cautioned to adhere 
strictly to the outlines and to use no 
new material. 

Then the debate was recorded on 
phonograph records, each speaker talk- 
ing from his brief and using no script. 
The expository statement by the chair- 
man was eight minutes long, the afhrma- 
tive constructive speech, six minutes, the 
negative argument thirteen minutes, and 
the afhrmative minutes. 


Thus, both the affirmative and negative 


rebuttal seven 


speakers had the same amount of time 
in which to effect persuasion. 
Finally, a transcript was made from 


the debate recording. The two tran- 
scripts were then used in deciding when 
during the two presentations opinions 


would be obtained. 


Because of the nature of discussion, 
uncontrolled. 
For example, in this study it is not 
possible to equate the intensity or im- 
pact of the debate and discussion. De- 
spite the fact that both contained es- 


several variables remain 


sentially the same arguments, the phras- 
ing, language, inflection patterns, and 
emphases may have varied. And in the 
discussion, it was not practicable to con- 
troi the amount of time given to various 
points of view. 
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As a partial solution of this problem, 
seven senior faculty members, thorough- 
ly familiar with debate and discussion, 
were asked to judge the quality of the 
presentations by marking a horizontal 
scale under which appeared the phrases 
“not a debate (or discussion) at all,” 
“fair,” “average,” “good,” and “one of 
the best I’ve heard.” After the ratings 
had been made on the continuum, each 
rating was given a value of from one to 
eleven depending on its scale position. 
the seven discus- 
sion ratings was 8.57; the mean value of 
the debate ratings was 7.86. The standard 
deviations were .49 and .64, respectively. 
By applying the “t-test’”' it was found 
that the critical ratio was 1.97, which 


The mean value of 


means that the chances are go out of 
1007 that there was no significant differ- 
ence in the quality of the debate and 
discussion. 

This procedure obviously places a 
large burden for the validity of the study 
upon the judgment of the evaluators 
but until more rigorous procedures are 
developed for measuring quality, we 
must depend on such subjective judg- 
ments. The pattern of opinion change 
would not be influenced to any great 
extent by small variations in quality, in 
any case. 

Having prepared and validated the 
materials, a modern electronic measur- 
ing instrument was selected to use with 
the experimental audiences. The G. E. 
Opinion Meter is a device designed to 
measure the opinion of a group of from 
six to twenty-four persons. Each of the 
twelve auditors tested at a time in this 
study held in his hand a small dial 
connected with the indicator. 
Each of these individual dials, calibrated 


central 


from o to 100 in steps of 10, has a 
1FE. P. Lindquist, Statistical Analysis in 
Educational Research (Boston, 1938), pp. 58-59. 
2R. A. Fisher, and Yates, F., Statistical 
Tables (London, 1939), p. 26. 
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pointer which may be set at any position questions were asked, the order being 


within that range. 

Complete disagreement is indicated 
by setting the pointer at 0; varying de- 
grees of disagreement by setting it from 
© to 49; indecision or neutrality by set- 
ting it exactly at 50; varying degrees of 
favorability, from 51 to 100; and com- 
plete agreement at 100. 


After an opinion is registered, the 
central indicator, also calibrated from 
0 to 100, can be adjusted to provide the 
following information: (1) the arithmet- 
ical mean of the twelve individual dial 
settings, (2) the percentage of the twelve 
stations set at the neutral position, and 
(3) the percentage of the stations whose 
pointers are set in the favorable half of 
the dial. By subtraction, the percentage 
of stations recording unfavorable opin- 
ion can be calculated. In this way the 
Opinion Meter provides information of 
a “favorable,” ‘‘unfavorable,” and “un- 
decided” nature and also indicates the 
strength of the group opinion on a 
100-point scale. 


Assuming that the Opinion Meter 
when so used actually measures opinion, 
one may still question the reliability of 
the measurement. A special study was 
set up to determine reliability before 
the main body of the experiment was 
undertaken. 


Twelve subjects were chosen at ran- 
dom from each of ten classes in an ele- 
mentary course in public address. Each 
of the ten audiences was instructed in 
the use of the Opinion Meter and then 
asked to express its opinion on each of 
five controversial questions. (Among 
these five was “Do you favor pressing 
now for World Federal Government?” 
the topic used in this study.) The mean 
opinion of each group of twelve in re- 
sponse to each of the five questions was 
recorded. The group then took a five 
minute break, after which the same five 


changed to minimize the influence of 
memory. The subjects were not in- 
formed that they would be re-tested, 
nor was the purpose of the test disclosed. 


To find the relationship of these 
tests, the rank-difference correlation 
method was used.* For the five ques- 
tions, the correlations were .81, .84, .92, 
.93, and .g5, the last and highest being 
the correlation on the 
government” question. It is probable, 
then, that the Opinion Meter, when 


“world federal 


used in this way, is a reliable instru- 
ment for measuring opinion. 


The groups tested in the study con- 
sisted of classes in Speech 401, the be- 
ginning course in public speaking at the 
Ohio State University. this 
course is required by several colleges of 
the University and is open for election 
by any student, a wide variety of inter- 
ests and backgrounds was represented. 


Because 


The debate recording was heard by 
ten audiences totaling 245 students; the 
discussion, by 10 audiences consisting 
of 252 persons. There was an average of 
25 persons in each audience, but be- 
cause of the limited capacity of the 
Opinion Meter, only 12 in each 
ence were tested. Thus the Opinion 
Meter tested 120 persons hearing the 


debate and the 


audi- 


120 persons hearing 
discussion. 

The testing took place during the 
period from February to April, 1949. 
Each testing period was fifty minutes 
long, the first ten minutes being used to 
familiarize the groups with the Opinion 
Meter. The recordings ran thirty-four 
minutes, and the remaining time was 
used for the actual testing. 

At the beginning of each test period, 
twelve persons were selected at random 
from the class. They were instructed in 


3 Henry E. 
and Education 


Garrett, Statistics in Psychology 
(New York, 1947), PP. 344°347- 
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the general use of the Opinion Meter 
and given several “warm-up” questions. 
Then one set of readings was taken be- 
fore the records were played, and the 
mean opinion of the group, the per 
cent favorable, and the per cent unde- 
cided were recorded on the question, 
“Should we press now for World Feder- 
al Government?” A_ second, third, 
fourth, and fifth set of readings was 
taken at the end of the consideration of 
each of the following issues in the dis- 
cussion agenda: 

1. What is 
ment? 
2. Would it be preferable to the U. N.? 

3. Could we get it now? 

4. If not, would it be desirable to 
press for it now? 


World Federal Govern- 


When the debate recording was tested, 
the same basic question asked 
(“Should we press now for World Fed- 
eral Government?”’) and readings were 
taken before the records were started, 
after the speech of exposition by the 
chairman of the debate, after the affiirm- 
ative speech, after the negative speech, 
and after the affirmative rebuttal, i.e., at 
the end of the debate. 

With this technique it was possible to 
observe and record the audience opinion 
not only before and after, but during 
stimulation. 


was 


II. RESULTS OF THE EXPERIMEN1 

One of the most important sets of 
data obtained with the Opinion Meter 
was the before and after mean opinion 
readings of the twelve persons tested 
in each of the twenty groups. Since these 
readings are indications of favorable o1 
unfavorable position on a 100 point 
scale, it is possible to subtract the post- 
test from the pretest readings and ob- 
tain the net shift of opinion of each 
group (Table I). By calculating the 
mean of the shifts for the ten groups 
hearing each of the two programs, the 
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net average shift of all the audiences is 
obtained. 


The mean shift of the discussion 
groups was — 12.6 points, and of the 
debate groups, — 1.3, both sets of audi- 
ences moving toward a more unfavora- 
ble position on the question of pressing 
now for world government. Shift of 
opinion in this experiment, however, is 
not dependent upon the direction of the 
shift; so a better measure of mean shift 
would result if the algebraic sign of the 
shift were ignored. On this basis the 
mean change in the discussion groups 
is still 12.6 points (all the audiences 
having changed opinion in the same 
direction), but the mean shift of the 
audience listening to the debate becomes 
4:50. The levels of confidence are .1% 
for discussion and 1% for debate, based 
on the numerical, not the algebraic, 
differences and after measure- 
ment. The possibility that these differ- 
ences in opinion are due to chance is 


before 


1 in 1000 and 1 in 100 for discussion and 
debate, respectively. With both methods 
of presentation, then, it is highly proba- 
ble that actual changes of opinion oc- 
curred. 


To discover whether the apparent dif- 
ference in the mean shifts of the two 
types of presentations was significant, a 
“t-score” formula was employed. The 
more meaningful numerical difference 
showed a level of confidence of 5%— 
Significant and in favor of discussion 
(Table I). 

The data demonstrate that significant 
changes of opinion did occur in discus- 
sion and debate audiences as a result 
of the two types of stimulation and 
that 
greater quantitative shift 
than did debate. 


discussion caused a_ significantly 


of opinion 


Next, considering the nature of the 
shift, Table II shows the Opinion Meter 
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rABLE I 


SHIFT OF OPINION AND SIGNIFICANCE OF THE SHIFTS IN DISCUSSION 


AND DEBATE AUDIENCES AS MEASURED BY THE OPINION METER READINGS BEFORE 




















AND 
AFTER PRESENTATION. 
Panel Discussion Formal Debate 
Aud. Before After Diff. Aud. Before After Diff. 

A {1 29 —12 K 42 36 — 6 

B 64 52 —12 L 62 59 — 6 
C 53 29 —14 M 65 70 5 

D 6o 48 —_ 6S 53 58 5 

E 50 41 =—§- @ 55 60 5 

F 5! 31 —20 P 57 58 1 

G 57 44 — = 42 32 <a 

H 61 56 — 5 R 64 58 = § 

I 64 48 —16 S 52 48 _ 

J 50 40 —10 T 55 55 a 
Mean of Algebraic Diff. —i2.60 Mean of Algebraic Diff. —1.30 
Mean of Numerical Diff. 12.60 Mean of Numerical Diff. 4-50 
Level of Confid. of Level of Confid. of 

Algebraic Diff. 1% Algebraic Diff. 80% 
Level ‘of Confid. of Level of Confid. of 
Numerical Diff. 1%, Numerical Diff. 1% 
Level of Confid. of Diff. between Discussion and Debate 
Algebraic Shift =o 
Level of Confid. of Diff. between Discussion and Debate 
Numerical Shift 507 
TABLE II 
OPINION METER VOTING RECORD OF DISCUSSION AND DEBATE AUDIFNCES BEFORI 
AND AFTER PRESENTATION. 
Discussion! Debate2 

Vote Before After Diff. Before After Diff. 

Yes 60 50 —10 66 61 —* 

Undecided 25 12 —13 18 12 —> 

No 35 58 33 36 47 11 








1120 respondents in 10 Audiences (A through J). 


> 


measure of the total number of persons 
in each of the three categories (favor- 
able, unfavorable) be- 
fore and after listening to the panel 


discussion and the formal debate. 


undecided, and 


Pretest-posttest differences in the total 
number of votes occurred in all three 
categories in the case of both presenta- 
tions, but again it is necessary to test 
the significance of those changes. For 
example, there were 60 persons whose 
Opinion Meter indications were in the 


2120 respondents in 10 audiences (K through T). 


favorable half of the dial before listen- 
ing to the discussion, and 50 persons 
whose indications were in that area after- 
wards. The net loss of those favorable 
was 10 persons. To calculate the signif- 
icance of the difference in these corre- 
lated groups it is necessary to know 
how many of the original 60 persons are 
represented in the final 50. The Opinion 
Meter, with its group indications, cannot 
furnish these data unless more elaborate 


testing procedures are used. The formu- 
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la which must be applied then, in the 
absence of more information about the 
individual respondent, gives the stan- 
dard error of difference between two 
uncorrelated percentages.* A “‘t-score”’ is 
obtained from this standard error, and 
levels of confidence are again read from 
the table of “t.””. But since a correlation 
factor is not included in this formula, 
the calculated 
shows the difference to be less significant 
than it would be in actuality. This must 
be taken 


level of confidence as 


into consideration when the 


data are interpreted. The approxima- 
tion shows that the shift in the number 
of persons favorable before and after 
discussion was fairly significant (at the 
20% level of confidence as indicated in 
Table III). The difference in the num- 
ber of persons undecided was highly 
significant, and in the number of those 
voting “no,” the difference was very 
highly significant. 


Assuming that both presentations ef- 
fected meaningful shifts of votes in the 
three categories, the significance of the 
difference in the effect caused by discus- 
sion and the effect caused by debate 
was on a level of confidence of approxi- 
mately 20% for the “yes” votes, 10% 
for the “undecided” votes, and 2% for 
the “no” votes (Table III). All these 
differences show that discussion exerted 
the greater influence on voting. 

Since mean opinion readings were 
taken with the Opinion Meter not only 
before and after, but during stimulation, 
valuable data are available on the pat- 
tern of change occurring during the 
discussion and debate programs. Five 
readings were taken from each discus- 
sion audience: before the presentation, 
and after discussion of each of the four 
issues in the agenda. 

From the debate audiences, five corre- 
sponding readings of mean group opin- 


TABLE Ill 


SIGNIFICANCE, OF THE DIFFERENCE BETWEEN BEFORE AND AFTER VOTING RECORDS FOR BoTH 


DIscUSSION AND DEBATE AND 


THE SIGNIFICANCE OF THE 


DIFFERENCE BETWEEN 


THEseE Two DIFFERENCES. 








Discussion 
No Before-After* 
Yes about 20% 
Undecided about 2% 
No above 1% 


Debate Discussion- 
Before- After* Debate 
about 50°% about 20%, 


about 20% 


above 10%, 
about 20% 


o7 
about 2% 








factor. 


In those audiences tested before and 
after debate (Table II), the difference 
in the number of those favorable was 
questionable in significance (50% level 
of confidence, as Table 
III), but the significance of the differ- 
ence in the number of 


indicated in 


unde- 
cided and unfavorable was _ probably 
significant (about the 20% level of con- 
fidence). Again, it should be remembered 
that the formula calculates a 
than-actual significance. 


persons 


lower- 


4 Garrett, Statistics, pp. 218-220. 


*Only an approximation—actual level of confidence would be higher due to correlation 


ion before the debate 
began, after the speech of exposition by 
the chairman, after the first affirmative 
speech, after the negative speech, and 
after the affirmative rebuttal. By sub- 
tracting each of these readings from the 
previous one, the shift effected by each 
part of the program can be obtained 
(Table IV). 


were obtained: 


In the discussion program, the means 
of the shifts of the audiences following 
discussion of each of the four issues in 


the agenda were: .8, — 6.5, — 3.3, and 
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TABLE IV 


DIFFERENCES AND SIGNIFICANCE OF DIFFERENCES IN MEAN OPINION RECORDED AT FIVE POINTS 
DuRING DISCUSSION AND DEBATE. 








Panel Discussion Differences 


Formal Debate Differences 














Aud. 1-24 2-3b 3-4¢ 4-54 
A o — 8 — 3 —1 
B 1 — 6 —1 —H 
C of —id —% ~ 
D 6 = ei =a 
E o —7 1 —3 
F —3 —9 —e2 —b 
G re) — 6 —e2 —5 
H 2 —4 — 3 o 
I 1 — 6 —5 —b 
J 4 4 —10 —* 
Mean 

Shift 8 —6.5 —3.3 —4.3 
Level 

Confid. 30% 1% 1% 1% 


Aud. 1-2e 2-3f 3-48 4-5h 
K 5 — 4 —7 0 
LL 3 3 — 8 o 
M 1 4 — 8 8 
N 1 1 —5 8 
O —11 1 —I14 1 
3 2 oO o --1 
Q 3 —2 — 9 
R 8 o — 3 —11 
S —2 3 —18 13 
a. 1 3 —11 7 
Mean 
Shift 1.1 BY —6.5, 3.4 
Level 
Confid. 50% 40% 5% 15% 








a Shift caused by discussion of issue 1 of the agenda. 
b Shift caused by discussion of issue 2 of the agenda. 
> Shift caused by discussion of issue 3 of the agenda. 

Shift caused by discussion of issue 4 of the agenda. 


Shift caused by debate affirmative constructive speech. 


c 
d 
e Shift caused by debate speech of definition. 
f 
g 


Shift caused by debate negative speech. 
h_ Shift caused by debate affirmative rebuttal. 


— 4-3 points based on the 100 point 
scale. Since a pattern implies direction, 
the algebraic sign of the shift was re- 
tained. ““T-scores” giving the significance 
of the difference in means of 
measures were calculated and levels of 
confidence obtained. These were 30%, 
1%, 1% and .1% (Table IV), respective- 
ly, for discussion of the four issues in 
the agenda. So the discussion of issues 
two, three, and four all effected signif- 
icant negative shifts. Interestingly, the 
first issue, the only one of a purely in- 
formative character, was the only one 
not effecting opinion change. 


related 


Data for the debate presentation were 
assembled similarly: the mean changes 
effected: by the four speeches were 1.1, 
7, —6.5, and 3.4 points, which are 
significant at the following levels of 
confidence (Table IV): 50%, 40%, 5%, 
and 15%, respectively. Only the nega- 
tive speech and probably the affirmative 
rebuttal caused significant changes in 


the opinions of the audiences. The other 
speeches may or may not have done so. 


III. SUMMARY AND CONCLUSIONS 


1. The Opinion Meter is a reliable 
measuring instrument when the test and 
retest are made within a short period 
of time. 

2. Panel discussion caused a greater 
quantitative shift of opinion than did 
formal debate, and this difference was 
highly significant. 

3. All discussion sections moved to- 
ward a more unfavorable position on 
the question; only four of the ten de- 
bate groups shifted negatively. 

4. Although both presentations in- 
creased significantly the number of per- 
sons voting “no” after the program, the 
discussion had a significantly greater 
effect in increasing the number of per- 
sons voting “no.” 

5. Although there were significantly 
fewer “undecided” persons after both 
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the discussion 
caused a significantly greater number of 
persons to move out of the undecided 
category or to stay out. 

6. There were significantly fewer per- 
sons voting “yes” after the discussion 
than there the program. 
But the change in the number of “‘ves”’ 
votes following debate was not signifi- 
cant. 


discussion and debate, 


were before 


7. The presentation of each issue in 
the discussion agenda, except the first 
which defined the problem, caused high- 
ly significant negative shifts. 

8. The data suggest that the debate 
speech of definition and the two affirm- 
ative speeches caused affirmative shifts 
in the audiences, but these shifts are of 
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questionable significance. The negative 
speech caused a highly significant nega- 
tive shift. 

In answering the two specific ques- 
tions posed at the beginning of the 
study, it may be said that: (1) discussion 
has a significantly greater net influence 
than debate on the opinion of an au- 
dience on a controversial public ques- 
tion, as measured by the electronic 
Opinion Meter in this study; and (2) 
the patterns of opinion change registered 
during the discus- 


course of panel 


sion and debate were quite different, the 
discussion changing opinion steadily in 
the same direction, while the debate ap- 
parently caused alternately positive and 
negative shifts. 








SHIFT OF ATTITUDE FOLLOWING PERSUASION 
AS RELATED TO ESTIMATE OF MAJORITY ATTITUDE* 


THOMAS M. SAWYER, JR. 


University 


HE aim of much public speaking is 

to persuade or to indice a shift of 
attitude or opinion. In an attempt to 
isolate the reasons for a speaker’s suc- 
cess, many studies of persuasion have 
employed rhetorical analysis of success- 
ful speeches or speakers. Comparatively 
few studies of persuasion have employed 
the manipulation of experimentally con- 
The technical difficul- 
ties of constructing reliable measuring 


trolled variables. 


instruments, and of designing controlled 
experiments in this area, have limited 
this type of research. Consequently, some 
apparently important variables, such as 
that of majority attitudes, have not 
been fully explored. 

No study has been found which de- 
scribes the relationship between the im- 
portant variables of majority position 
and shift of attitude following persua- 
Yet Sherif 
his experiment with 


sion. has demonstrated in 


the autokinetic 
effect that in an ambiguous situation 
the individual tends to adhere to the 
group, or majority norm.’ Marple has 
also established that when the majority 
position on an attitude scale is indicated 
by the experimenter individuals will 
tend to shift their attitudes toward this 
apparent majority.’ 

This suggests the question: How far 
removed from the apparent majority 
position can a speaker appear to be and 

*Based upon Ph.D. dissertation, University 
of Michigan, 1953, directed by William S. Sattler. 

1 Muzafer Sherif, The Psychology of Social 
Norms (New York, 1936). 

2C. H. Marple, “The Comparative Suscep- 
tibility of Three Age Levels to Suggestion of 


Group Versus Expert Opinion,” Jour. of Social 
Psychology, 4 (1933), 176-186. 


of Michigan 


still be persuasive? According to Hov- 
land, Lumsdaine, and Sheffield he will 
have a better chance of creating large 
shifts of attitude if the audience is op- 
posed to him, simply because they can 
shift further than those already favor- 
able. As Schachter has found, though, 
the speaker cannot deviate too much 
from the group norm, for if he does, the 
individual in the audience is likely to ad- 
here to the majority position and re- 
ject the speaker.* It might well be that 
the argument presenting “both sides,” 
which Hovland, Lumsdaine, and Shef- 
field found most effective in persuading 
opposed listeners, was effective princi- 
pally because it makes the speaker ap- 
pear to be closer in attitude to the ma- 
jority. The “one-sided” argument might 
make him appear to deviate too much 
from the majority and thus be rejected. 
It is with the analyses of this problem 
that this study is concerned. 


I. HyYPoruHESEs 


A. General. ‘This experimental study 
was designed to test the general hy- 
pothesis that persons persuaded by an 
argument were those who perceived a 
relatively small difference between the 
attitude of the speaker and the attitude 
of the majority. This: major hypothesis 
was formally phrased thus: The extent 


3C. L. Hovland, A. A. Lumsdaine, and F. D. 
Sheffield, Experiments on Mass Communication, 
Vol. III of Studies in Social Psychology in World 
War Il, ed. S. A. Stouffer. (Princeton, N. J., 
1949). 

4Stanley Schachter, “Deviation, Rejection 
and Communication,” Jour. of Abnormal and 
Social Psychology, 46 (1951), 190-207. 
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of an auditor's favorable shift of attitude 
varies inversely with his perception of 
the difference between the speaker's at- 
titude and the attitude of the majority. 

This general hypothesis was devised 
after a pilot study had suggested that 
it might be possible to predict who 
would be persuaded by a speech. In 
(to the 
speaker) shifts of attitude came from sub- 
jects whose original attitude had been 
on the side of the mean; 
most of the “opposed” (to the speaker) 


this pilot study “favorable” 


“favorable” 


shifts came from subjects at the “op- 
posed” end of the attitude scale. It 
seemed, at first glance, that the audi- 
tor’s original attitude-distance from the 
speaker determined the directon of the 
shift. 


But this did not account for the few 
anomalies, those persons originally op- 
posed who nevertheless were persuaded, 
and those originally favorable who be- 
came opposed. The evidence accumu- 
lated by Sherif, Marple, and Schachter 
suggested that the influence of the ma- 
jority attitude would account for all 
these shifts. Persons who shifted toward 
the speaker’s extremely favorable po- 
sition were those who estimated that the 
majority was reasonably favorable, and 
thus close to the speaker. Persons who 
shifted from the speaker were 
those who estimated that the majority 


away 


was rather opposed, or far from the 
speaker. 


B. Specific. The primary specific hy- 
pothesis dealt with the expected signif- 
icant difference between estimates of 
“Majority Attitude” by those persuaded 
and those not persuaded.’ Since the 
point toward which the speaker would 
attempt to shift the audience was desig- 
nated as the zero point on the attitude 


5 Throughout this experiment a difference 
between mean scores was considered significant 
if it reached the 5% level of confidence. 


scale, and the opposite end was scored 
high, it would follow that: 

1. There will be a significantly lower 
mean score on estimates of ‘Majority 
Attitude” both (a) before the argument 
(Majority I) and (b) after the argument 
(Majority II) on the part of persons 
shifting favorably to the speaker, than 
on the part of those shifting opposed. 

The same hypothesis was framed in a 


slightly different way. This involved 
treating as a raw score the difference be- 
tween the 
speaker’s position and the subject’s esti- 
mate of the majority position. This 


difference score was labeled “Speaker 


the subject’s estimate of 


Conformity.” The smaller the score, the 
less the difference between the two esti- 
mates; the larger the score, the larger 
the difference. According to the general 
should 
perceive a small difference, persons not 
persuaded, a large difference. Therefore: 


hypothesis persons persuaded 


2. There will be a significantly small- 
er mean score on “Speaker Conformity” 
on the part of persons shifting favorably 
to the speaker, than on the part of per- 
sons shifting opposed. 


This hypothesis might also be ex- 
pressed in correlational form between 
“Speaker Conformity” and “Own Shift,” 
or the difference between an individual's 
attitude score before the argument and 
his attitude score 
Thus: 


after the argument. 

3. There will be a significant nega- 
tive correlation between “Speaker Con- 
formity” scores and “Own Shift” scores. 

The difference between an individual's 
own attitude score and his estimate of 
the majority attitude was labeled “Own 
Conformity.” Since it was _ predicted 
that the majority attitude should be in- 
fluential in determining the direction 
of attitude shift, individuals should be 
more conformist after the argument than 


before. In other words: 
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4. There will be a significantly small- 
er mean “Own Conformity” score after 
the argument than before. 

This was also put into correlational 
form, thus: 

5. There will be a significantly higher 
correlation between “Own Attitude” 
and estimate of “Majority Attitude” 
after the argument than before the ar- 
gument. 

It was also predicted that other dif- 
ferences and correlations would be non- 
significant, thus: 

6. There will be no significant mean 
difference between persons shifted fa- 
vorably and persons shifted opposed 
on estimates of the ‘“Speaker’s Attitude.” 

7. There will be no significant corre- 
lation between scores of “Speaker Agree- 
ment” (difference between individual's 
own attitude before argument and his 
estimate of speaker’s attitude afterward) 
and “Own Shift.” 

8. Neither age group (18 to 21, 21 
and older) mean attitude will be shifted 
significantly. 

g. Neither sex group mean attitude 
will be shifted significantly. 

10. No mode group (listeners, read- 
ers, televiewers) mean attitude will be 
shifted significantly. 


II]. EXPERIMENTAL DESIGN 

A. The Topic. The topic selected 
for this experiment was “Lowering the 
Voting Age to Eighteen.” This topic was 
expected to be relatively fresh to student 
audiences, and as a result it was felt that 
attitudes toward this proposal were not 
rigidly fixed and could be changed. 
Secondly, the experiment was to take 
place during an election year when in- 
terest in political events, especially vot- 
ing, was presumed to be high. 

B. The Speaker. The experimenter 
chose a student from one of his own 
classes in public speaking, a freshman 


: , ' 
engineering student. The choice was | 
based upon the speaker’s ability and in- | 


terest, but also upon the fact that he | 


the audiences before 
whom he was to speak. Consequently, 
the audiences would be forced to esti- 
mate the speaker’s attitude from what 
he said and could not rely upon previous 
friendship with him. The construction 
of the speech was left up to the speaker. 
The experimenter supplied evidence, 
advice, and criticism, but the speaker 
made final decisions about choice of 
diction, the selection and arrangement 
of arguments, the style of delivery. 


was unknown to 


C. Modes of Presentation. The speech 
was presented in three different ways: 
the final draft of the speech was dittoed 
for presentation in printed form; the 
speaker memorized this final draft for 
presentation before live audiences and 
before televiewers. The printed speech 
and the speeches before live audiences 
were presented in regular classroom 
situations. The television section of the 
experiment was presented over a closed- 
circuit, single camera setup provided by 
the University of Michigan Department 
of Speech in a special classroom. 


D. The Subjects. The subjects for 
the experiment were students in first- 
year speech classes (Speech 31 and 32) 
in the College of Literature, Science and 
Arts at the University of Michigan. The 
total sample consisted of 170 subjects: 
94 were men, 76, women; 116 were under 
21 years of age, 54 were 21 or older. Sub- 
jects received the speech in three ways: 
75 heard the live speaker, 67 read the 
printed version of the speech, 28 watched 
the telecast. Because of scheduling dif- 
ficulties volunteers were requested for 
the television section of the experiment. 
Only 28 responded, and it is doubtful 
that this 
sample. 

E. The Measuring Instrument. The 


group represents a random 
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attitude measure was constructed follow- 
ing the technique suggested by Edwards 
and Kilpatrick. Using this technique 
Edwards and Kilpatrick achieved a Gutt- 
man “coefficient of reproducibility” of 
85. No coefficient of reproducibility 
has yet been computed for the attitude 
test used in this experiment. 

The construction of this type of atti- 
tude test combines the techniques of 
Thurstone and Likert. First, 125 state- 
ments related to lowering the voting 
age were selected from magazirie articles. 
Each was printed on a separate slip of 
paper and all 125 were inserted in an 
envelope together with 11 cards lettered 
from A through K (or from 1 through 
11). One hundred subjects were asked 
to arrange these 125 statements into 11 
piles according to the strength and 
direction of the attitude expressed in 
each statement, from A, or 1 (strong- 
ly in favor) through F, or 6 (neutral) 
to K, or 11 (strongly opposed). A curve 
was now drawn for each of the 125 
statements using the intervals 1 through 
11 as base line units and the cumulative 
number of pile placements as vertical 
ordinate units 0 to 100. This results in 
an ogive, or S-shaped, curve. 

The scale value of each statement was 
determined by plotting where the curve 
crossed the 50% line. The dispersion of 
placements was determined by plotting 
where the curve crossed the 25% and 
75% lines, the greater the distance on 
the base line between these two points, 
the greater the dispersion (or Q-value) 
and the less useful the statement for an 
attitude test. The mean Q-value for the 
125 statements was 2.15 base line units. 
Statements with a larger Q-value were 
discarded, leaving 75 statements with 


/ 
scale values ranging from .3, or very 


6A. L. Edwards, and F. P. Kilpatrick, “A 
Technique for the Construction of Attitude 
Scales,” Jour. of Applied Psychology, 32 (1948), 
374-383. 


7! 


8 and 6.1 on either 
side of neutral, to 10.2, or very opposed. 

Second, to screen out statements which 
did not discriminate the remaining 75 
statements were arranged in the form 
of an attitude test with a six-point scale 
of responses: Agree, Agree, 
Mildly Agree, Mildly Disagree, Disagree, 
Strongly Disagree. This test was given to 
40 subjects. 


favorable, through 5 


Strongly 


Statements with favorable 
scale values (.3 to 5.8) were scored 1 if 
any one of the “Disagree’’ responses was 
encircled, while statements with an 
opposed scale value (6.1 to 10.2) were 
scored 1 if any one of the “Agree” re- 
sponses was encircled. The range of 
scores was from 10, or favorable, to 57, 
or opposed. 

The ten tests with the lowest total 
scores were compared statement by state- 
ment with the ten tests with the highest 
total scores. The greater the difference 
of response on a statement between the 
two groups the larger the “phi coefh- 
cient” and the better the statement dis- 
criminated. Phi coefficients were read 
from Jurgensen’s tables.* ‘The mean phi 
coefficient of the 75 statements was .702. 
Statements below 
carded. 


this mean were dis- 
The effect of the second step 
was to eliminate statements near the 
neutral point on the scale and retain 
statements at either extreme. This left 
22 statements with a mean Q-value of 
1.8 and a mean phi coefficient of .819; 10 
statements scaled from 1.8 to 3.4 (favor- 
able), followed by a gap, then 12 state- 
ments scaled from 8.0 to 9.6 (opposed). 

These 22 statements were randomly 
arranged and each was provided, with 
the same six-point scale of responses. 
The 10 favorable statements were scored 
from SA-o to SA-5; the 12 opposed state- 
ments were scored from SD-o to SA-5. 
On this finished attitude test three sets 

7C. E. Jurgensen, “Table for Determining 
Phi Coefficients,” Psychometrika, 12 (1947), 17- 
29. 
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of responses were supplied for each state- 
ment: one for the subject’s “Own Atti- 
tude,” the second for his estimate of 
“Majority Attitude,” the third for his 
estimate of “Speaker’s Attitude.” 

F. Conduct of the Experiment. The 
subjects were asked to fill out the first 
two sets of responses on one copy of the 
attitude test in their regular speech 
classes. No explanation was given. A 
week or two weeks later at the start of 
the class the printed copy of the speech 
was passed out without comment, or the 
speaker was briefly introduced with no 
attempt to create ethos tor him, or vol- 
unteers were requested for an experi- 
ment in television at a later hour. Im- 
mediately after either reading or hear- 
ing the speech, or watching it on tele- 
vision, the subjects were asked to fill 
out all three sets of responses on a sec- 
ond copy of the attitude test. 

Unfortunately, although the closed 
circuit television hookup was technical- 
ly smooth, the building in which the 
presentation took place was under re- 
pair and extraneous noise may have 
modified the results of this part of the 
experiment. 


III. ReEsuLts 
The “Own Attitude” of the 
170 subjects before the experiment was 
50.9, or slightly on the favorable side of 


mean 


After the the mean | 
“Own Attitude” was 49.6 indicating no 
persuasive effect by the speech on the 


experiment 


total sample. However, later analysis 
revealed that 102, or 60% of the sub- 
jects shifted favorably, whereas only 68, 
or 40%, shifted opposed. The difference 
between these percentages is significant 
at the 1% level (t-2.60), provided it is 
assumed in computing the S.E. of the 
percentage that the 50% should have 


shifted opposed. 

After the experiment the mean “Ma- 
jority Attitude” was 41.8, a favorable 
shift significant at the 5°% level, indi- 
cating that the subjects thought the 
(See 
Estimates of the “Speaker's 
the 


majority had shifted favorably. 
Table I.) 
Attitude” were bunched closely at 
favorable end of the scale. 

A. Analysis of 
When the 


shift” subjects and “opposed shift’ sub- 


Regression 


Effect. 
sub-groups of “favorable 
jects were analyzed, it was discovered 
that the mean “Own Attitude” of the 
“favorable shift” group had originally 
been significantly more opposed than 
that of the “opposed shift” group. Afte1 
the speech these two groups merel\ 
exchanged positions, shifting significant- 
ly in opposite directions. (See Plates I, 
II, and Table II. 
the mean “Own Attitude” of the total 


This explained why 








the scale. The mean “Majority Atti- sample had not changed, but it also sug- 
tude” was 45.3 or significantly (t-2.95, gested that any shifts in this experiment 
P-1°%) more favorable. might be due to statistical regression, 
TABLE I 
COMPARISON OF MEAN Scores OF TOTAL SAMPLE. 
N-170 
Measure Mean Measure Mean t 
Own Attitude I 50.9 Own Attitude II 49-4 0.63 
Majority Attitude I 45-3 Majority Attitude II 41.8 2.18* 
Own Conformity I 11.6 Own Conformity II 14.0 1.78 
Speaker Conformity I 26.1 Speaker Conformity II 23.1 1.87 
Own Attitude I 50.9 Majority Attitude I 45-3 2.95°* 
Own Attitude II 49.4 Majority Attitude II 41.8 3.99** 








*Significant at the 5% level. 
**Significant at the 1% level. 
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TABLE II 


COMPARISON OF MEAN SCORES OF FAVORABLE AND OpposEeD SHIFT GROUPS. 

















Favorable Shift Group N-102 








Measure Mean Measure Mean t 
Own Attitude I 53-5 Own Attitude II 46.3 2.56° 
Majority Attitude I 47-4 Majority Attitude II 40.5 say"? 
Own Conformity I 12.0 Own Conformity II 12.0 0.02 
Speaker Conformity I 27.2 Speaker Conformity II 21.7 2.46* 
Own Attitude I 53-5 Majority Attitude I 47-4 2.42° 
Own Attitude II 46.3 Majority Attitude II 40.5, 2.36* 

Opposed Shift Group N-68 
Own Attitude I 47.1. Own Attitude II 54-5 sic” 
Majority Attitude I 42.1 Majority Attitude II 44.0 0.81 
Own Conformity I 11.0 Own Conformity II 17.0 2.69* * 
Speaker Conformity I 24.5 Speaker Conformity II 25.3 0.34 
Own Attitude I 47.1 Majority Attitude I 42.1 1.74 
Own Attitude II 54.5 Majority Attitude II 44.0 sa4°° 











Favorable Shift Opposed Shift 


Measure Group Mean Group Mean t 
Own Attitude I 53-5 47-1 2.04* 
Majority Attitude I 47-4 42.1 2.30° 
Own Conformity I 12.0 11.0 0.57 
Own Attitude II 46.3 54-5 2.59* 
Majority Attitude II 40.5, 44.0 1.58 
Own Conformity II 12.0 17.0 2.38* 
Speaker's Attitude 24.1 23.6 0.18 
Speaker Conformity I 27.2 24.5 1.22 
Speaker Conformity II 21.7 25.3 1.56 
Own Shift 4. 7.3 0.12 
Majority Shift 9.1 8.5, 0.51 








“Significant at the 5% level. 
**Significant at the 1% level. 


the tendency of individuals to shift 
toward the mean of the total sample on 
a second test. 


To check on this it was decided to 
perform a ¢ test on the difference be- 
tween the second “Own Attitude” mean 
of each of these sub-groups and the first 
“Own Attitude’”’ mean of the total sam- 
ple. This is a statistically rigorous test. 
If the difference between the sub-group 
second mean and the total sample first 
mean proved to be significant it could 
be assumed that more than regression 
was involved the shift of attitude. 
These differences were not significant, 
however. The second “Own Attitude” 
mean of the favorable shift group (46.3) 
and the second “Own Attitude” mean 
of the opposed shift group (54.5) were 


in 


not significantly different from the orig- 
inal mean of the total sample (t-1.88, 
t-1.20). Thus regression cannot be ruled 
out as the cause of individual attitude 
shifts in this experiment. 


Since scores on estimates of “Majori- 
ty Attitude” were correlated with scores 
on “Own Attitude” at 0.65 before the 
argument and at 0.52 after the argument, 
regression might have caused the shifts 
in “Majority Attitude” also. To check 
on this a second test was performed. 
This involved selecting from the two 
sub-groups those subjects whose original 
“Own Attitude” scores could be matched 
precisely. Forty such subjects were found 
in each group. The mean of their origi- 
nal “Own Attitude” scores was, of 
course, the same (47.6). Estimates of 
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“Majority Attitude” by these groups, if 
shifted by regression alone, should be- 
come less different, rather than growing 
larger. This was the The 


not case. 


tude shift. This is probably the most im- 
portant finding of this study. 


B. Analysis of Specific Hypotheses. 1. 
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TABLE Ill 
COMPARISON OF MATCHED SHIFT GROUPS. 
N-40 
Favorable Shift Group 
Measure Mean Measure Mean t 
Own Attitude I 476 Own Attitude II 40.1 1.97 
Majority Attitude I 45-3 Majority Attitude II 39.0 2.34° 
Own Conformity I 9.5 Own Conformity II 8.8 0.39 
Speaker Conformity I 24.3 Speaker Conformity II 20.5 1.28 
Own Attitude I 47-6 Majority Attitude I 45-3 0.69 
Own Attitude II 40.1 Majority Attitude II 39.0 0.19 
Opposed Shift Group 
Own Attitude I 476 Own Attitude II 53.6 1.44 
Majority Attitude I 43-4 Majority Attitude II 44.3 0.34 
Own Conformity I 9.2 Own Conformity II 13.8 1.83 
Speaker Conformity I 25.2 Speaker Conformity II 24.6 0.19 
Own Attitude I 47-6 Majority Attitude I 43-4 1.21 
Own Attitude II 53-6 Majority Attitude II 14-3 2.54° 


Measure 


Own Attitude I 
Majority Attitude I 
Own Conformity I 
Own Attitude II 
Majority Attitude II 
Own Conformity II 
Speaker’s Attitude 
Speaker Conformity I 
Speaker Conformity II 
Own Shift 

Majority Shift 


Favorable Shift 


Opposed Shift 


Group Mean Group Mean t 
17-6 47.6 0.00 
45-3 43-4 0.72 
9.5 9.2 0.16 
40.1 53-6 $-37°° 
39.0 44:3 1.84 
8.8 13.8 2.09* 
25-9 24-5 0.34 
24.3 25.2 0.29 
20.5 24.6 1.48 
7.5 6.0 1.05 
8.3 6.6 1.16 








*Significant at the 5% level. 


**Sionificant at the 1°% level (meaningless in this case because of the selection of the sample.) 


dence of this reversal of the general hy- 
pothesis. 

2. “Speaker Conformity” (Hypothe- 
sis 2) had predicted that the favorable 
shift group would perceive a significant- 
ly smaller difference between their orig- 
inal estimate of the “Majority Attitude” 
and their estimate of the “Speaker's 
Attitude” than would the opposed shift 
group. This hypothesis was also reversed, 
but not significantly. The favorable 
shift group had a larger mean “Speaker 
Conformity 


score than the opposed 
shift group, but this difference was non- 
significant (Table I1). 

3. Correlation Between “Speaker Con- 
formity” and “Own Shift” (Hypothe- 
sis 3) had predicted a significant nega- 


tive correlation between “Speaker Con- 
formity” and “Own Shift.” This predic- 
tion was also reversed, but not signif- 
icantly. The correlation was positive, 
but non-significant (r-.o62). 

{. “Own Conformity” (Hypothesis 
1) had predicted that the mean “Own 
Conformity” score for the whole sample 
should be smaller after the argument. 
Instead it was larger. The difference 
between the two means, however, was 
non-significant (t-1.78). 

5. Correlation Between “Own Atti- 
tude” and “Majority Attitude” (Hypoth- 
esis 5) which had predicted a higher 
“Own Attitude” 
and “Majority Attitude,” after the ar- 


gument, 


correlation between 


was, of course, reversed also. 
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SHIFT OF ATTITUDE FOLLOWING PERSUASION 


The correlation before the argument 
afterwards it became 0.52. 
This difference is not quite significant: 
(t-1.82). 

The five remaining nuli hypotheses 
could not be rejected. ‘The difference in 
mean estimates of “Speakers Attitude” 
by the two shift groups was non-signif- 
icant (t-0.18). The correlation between 
“Speaker Agreement” and “Own Shift” 
was non-significant (r-o.11). Neither age 
group shifted its mean “Own Attitude” 
scores significantly (Younger,  t-0.70; 
Older, t-o.10). Neither sex group shifted 
its mean “Own Attitude” scores signif- 
icantly (Men, t-o.24; Women, t-0.69). 
And, finally, no mode group shifted 
Attitude” 


(Listeners, t-o.78; Readers, t-o.20; Tele- 


“Own scores significantly 


viewers, t-0.00). 


IV. Discussion 

After listening to the experimental 
speech, several observers predicted, in- 
accurately it turned out, that attitudes 
would be readily shifted in the direction 
desired by the speaker. The principal 
finding of this study suggests the rea- 


' son for their poor guesses. Since estimate 


of majority attitude shift seems to be a 
function of shifts of individual attitude, 
the the 
speech biased their judgments of the 


observers’ own reactions to 
majority reaction. 

An analysis of the scores of the total 
sample suggests that some additional 
variables were considered con- 
trolled in the course of this experiment. 


One of these uncontrolled variables may 


not or 


have been the saliency of the majority 
attitude. The wide dispersal of esti- 
mates of “Majority Attitude” combined 
with the wide dispersal of “Own Atti- 
tude,” indicates the possibility that the 
majority attitude was not salient, in 
other words, not easily perceptible. The 
fact that most of the subjects estimated 
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that the majority was more liberal than 
they were themselves may indicate that 
these people felt that college students 
ought to be liberal on most issues, but 
were uncertain about /ow liberal they 
were on this issue. If this saliency is an 
important variable, then it would seem 
desirable to retest the general hypothesis 
utilizing a topic about which there is a 
sharply defined majority attitude, per- 
haps indicated by a small dispersal of 
estimates of majority attitude. 

A second possible unconsidered and 
uncontrolled variable might be the co- 
hesiveness of the class groups. The sig- 
nificant mean difference between “Own 
Attitude” and estimates of “Majority 
Attitude” may indicate that, perceptible 
or not, the attitude of the majority had 
little value to these subjects. ‘The phras- 
ing of the directions on the attitude test 
may have contributed to low group 
cohesiveness. There it was suggested that 
the rest of the class might serve as the 
majority, and the opinion of the rest 
of the class may have seemed of small 
importance to these subjects. A repeti- 
tion of this experiment utilizing groups 
with a measurable degree of cohesiveness 
might prove fruitful. 


V. CONCLUSIONS 


the 
ment altered their mean estimate of the 


1. Persons persuaded by argu- 
majority attitude to a significantly more 
favorable position, while those not per- 
suaded did not significantly alter their 
mean estimate of the majority attitude. 
Since statistical regression does not seem 
to be the sole cause of this effect, esti- 
mates of shift of majority attitude seem 
to be a function of individual attitude 
shift. 

2. Persons not persuaded by the ar- 
gument perceived a significantly greater 
attitude distance 


between themselves 


and the majority after the argument 
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than they did originally. Persons _per- 
suaded perceived no significant change 
of this distance. 

3. Persons persuaded by the argu- 
ment were originally significantly more 
opposed to the proposition than were 
those not persuaded. 

4. The mean attitude of the subjects 
persuaded by the argument became sig- 
nificantly more favorable to the propo- 
sition, while the mean attitude of sub- 
jects not persuaded became significant- 
ly more opposed. However, statistical 
regression cannot be ruled out as a 
cause of this effect. 


5: The mean estimate of majority at- 
titude by the total sample was signif- 
icantly more favorable to the proposi- 
tion both before and after the argument, 
than the mean of individual attitudes. 

6. The mean estimate of the majority 
attitude by the total sample became sig- 
nificantly more favorable to the propo- 
sition after the argument. 

7. After the argument a significantly 
greater percentage of subjects in the 
total sample shifted attitudes favorably 
than could be expected by chance. How- 
ever, there was no significant mean shift 
of attitude scores. 





V 


C 


